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EXECUTIVE  SUMMARY 


The  outcone  of  araed  conflict  always  depends  on  the  decisions  made  by  the 
participants  as  the  conflict  unfolds*  As  part  of  Its  program  In  aatheaatlcs, 
the  Office  of  Naval  Research  (ONR)  has  sponsored  work  to  develop  novel  models 
of  decisionmaking  during  conflict.  In  order  to  broaden  understanding  of  the 
factors  leading  to  alternative  outcomes.  In  this  final  report,  prepared  under 
contract  N0001A-84-C-0458  for  ONR,  new  results  in  decision  modeling  and  new 
results  obtained  using  these  models  are  described. 

The  Navy  needs  a  predictive  mathematical  theory  of  decisionmaking  In 
dynamic,  multiperson  decision  situations  to  better  understand  command  and 
control  In  Naval  conflict.  Models  representing  both  friendly  and  hostile 
forces,  both  sides  distributed  over  the  sea  with  limited  communications  among 
participants  on  each  side,  must  ultimately  be  treated  by  formal  mathematical 
theory  to  provide  a  sound  basis  for  future  system  design.  The  theory  should 
combine  results  from  the  mathematics  of  multlperson  decisionmaking  with 
results  from  behavioral  theories  of  human  decisionmaking  under  uncertainty. 
This  novel  combination  of  approaches  would  provide  valuable  qualitative  and 
quantitative  insights  Into  the  behavior  of  distributed  decisionmakers  In  com¬ 
plex  situations  of  interest  to  the  Navy.  The  work  reported  here  is  a  contri¬ 
bution  towards  the  multidisciplinary  theoretical  extension  needed,  drawing  on 
existing  prescriptive  mathematical  approaches  to  multiperson  decisionmaking 
and  on  descriptive  approaches  to  human  behavior  modeling  reported  in  the  beha¬ 
vioral  decision  theory  literature. 
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Macheoatlcal  game  theory  studies  decisionmaking  In  problems  with  multiple 
decisionmakers.  Game  theoretic  results  characterize  decision  strategies  and 
outcomes  which  are  rational  in  the  context  of  specific  quantitative  behavioral 
norms.  Hence,  the  purpose  of  game  theory  is  to  prescribe  "rational  behavior" 
in  a  multiperson  decision  situation.  However,  game  theory  has  several  short¬ 
comings  in  providing  a  predictive  theory  of  multiperson  decisionmaking: 

1.  It  is  based  on  the  assumption  that  all  decisions  will  be 
strictly  rational  decisions,  allowing  for  no  deviations 
from  rationality. 

2.  Rational  behavior  is  defined  in  terms  of  Von  Neumann's 
maximum  expected  utility  paradigm  [1]. 

3.  It  assumes  that  every  decisionmaker  has  a  common  represen¬ 
tation  of  the  overall  decision  problem  (the  rules  of  the 
game  and  the  game  parameters  are  common  knowledge  [2]). 

4.  It  assumes  that  players  are  fully-committed  to  select 
future  decisions  in  accordance  %rlth  aprlorl-selected 
strategies.  It  does  not  permit  adaptive  decisionmaking 
behavior. 

5.  It  assumes  that  processing  of  information  by  the  decision¬ 
makers  will  be  done  optimally,  in  a  Bayesian  framework  [3]. 

6.  It  does  not  explicitly  consider  human  limitations  and  beha¬ 
vioral  trends  in  information  processing,  option  evaluation, 
and  action  selection. 

Empirical  research  on  human  decisionmaking  [4], [5],  has  established  that  all 
of  the  above  assumptions  are  systematically  violated  in  specific  situations. 
This  work  has  led  to  several  extensions  of  the  basic  framework  of  game  theory. 
For  example,  the  recent  research  in  perfect,  proper  and  sequential  equilibria 
in  nonzero  sum  games  [6]-[8]  is  concerned  with  defining  rational  behavior  in 
a  way  tfhich  takes  into  account  the  possibility  of  "irrational"  actions.  The 
work  of  Aumann  and  Maschler  [9],  Ho  (10],  and  others  [II]  studies  games  where 
the  assumption  of  prior  committment  to  aprlori  strategies  was  relaxed.  The 


work  of  Harsanyl  [12], (13],  and  others  ([15]-*[16])  O”  games  of  Incomplete  Is 
aimed  at  relaxing  the  assumption  that  every  decisionmaker  has  a  common  repre¬ 
sentation  of  the  overall  problem.  The  work  of  Kadane  and  Latkey  [17], [18]  and 
Wilson  [19]  advocates  viewing  the  multiperson  decision  problem  as  a  set  of 
subjective  single  person  decision  problems,  where  ^ach  participant  models  the 
expected  actions  of  the  other  decisionmakers  by  subjective  probabilities. 

The  purpose  of  our  research  effort  was  to  further  develop  the  mathemat¬ 
ical  theories  of  multiperson  decisionmaking  by  explicitly  considering  multi¬ 
person  decision  models  which  Incorporate  human  limitations  and  behavioral 
trends  In  Information  processing,  option  evaluation,  and  action  selection. 

It  Is  our  opinion  that  a  predictive,  quantitative  theory  of  multiperson  deci¬ 
sionmaking  must  adopt  the  viewpoint  of  a  person  evaluating  his  decision  alter¬ 
natives  In  a  real-time,  dynamically-evolving  situation.  Thus,  at  any  time,  a 
human's  decision  problem  separates  Into  three  stages: 

1.  How  he  Interprets  the  Information  he's  observed  In  the  past, 

2.  How  he  ranks  his  possible  decisions  at  the  present  time,  and 

3.  How  he  selects  a  decision  based  on  these  rankings. 

These  stages  correspond  to  the  last  three  stages  In  the  SHOR  paradigm  for 
human  decisionmaking  discussed  In  Wohl  [20].  The  SHOR  paradigm  Is  Illustrated 
Fig.  1.  Using  this  paradigm,  we  were  able  to  organize  the  classes  of  human 
behavioral  traits  which  should  be  considered  In  the  multiperson  decision 
models.  In  Che  first  stage,  we  had  to  consider  biases  and  limitations  in  the 
interpretation  and  combination  of  information  obtained  through  observations  of 
the  decision  problem.  In  the  second  stage,  we  had  to  consider  how  individual 
decislomakers  would  evaluate  their  choices;  behavioral  decision  theory  has 
proposed  a  number  of  competing  axiomatic  models  (e.g.,  [21])  for  this  process. 
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In  the  last  stage,  we  had  to  consider  how  actions  were  selected  based  on  the.tr 
evaluation  (e.g.,  [22], [23]}. 

Starting  fron  this  premise,  we  began  our  research  by  reviewing  available 
results  on  human  behavior  In  Information  processing,  option  evaluation  and 
response  selection.  A  brief  summary  of  some  of  these  results  Is  included  In 
Appendix  A.  Essentially,  the  literature  contains  ample  evidence  that  one  of 
the  fundamental  atisumptlons  of  game  theory,  defining  rational  behavior  In 
terms  of  maximization  of  expected  utility,  is  systematically  violated  (see 
Machlna's  survey  [24]).  Thus,  many  alternative  formulations  for  prescribing 
rational  behavior  or  describing  human  behavior  have  been  proposed  ([4], [5]). 
Researchers  such  as  Kadane  and  Larkey  [17]  have  gone  as  far  as  suggesting 
that  a  prescriptive  theory  for  human  decisionmaking  In  multiperson  decision 
problems  should  adopt  a  single  decisionmaker  perspective,  where  the  actions 
of  other  decisionmakers  are  modeled  by  a  subjective  probability  ovtr  the 
possible  set  of  actions.  There  has  been  considerable  debate  on  this  approach, 
centering  on  whether  the  subjective  probability  should  depend  on  the  current 
choice  of  action  of  the  decisionmaker,  and  on  how  such  subjective  probabil¬ 
ities  are  computed  [25]-[28]). 

The  next  step  In  our  research  addressed  the  key  philosophical  question 
concerning  any  theory  of  multiperson  declsicumaking,  namely:  What  informa¬ 
tion  does  each  decisionmaker  have  concerning  the  behavioral  characteristics 
of  the  other  decisionmakers?  To  illustrate  the  importance  of  this  question, 
consider  the  game-theoretic  model.  In  game  theory,  this  information  is  part 
of  the  rules  of  the  game.  Hence,  this  Information  is  represented  as  common 
knowledge.  Specifically,  the  utility  function  of  each  decisionmaker  is  known 
to  every  other  decisionmaker.  Using  Harsanyi's  theory  of  games  of  incomplete 


information  [12],  this  assumption  can  be  relaxed  so  that  an  Individual's 
utility  function  Is  known  only  probabilistically  to  other  decisionmakers. 
However,  this  probability  distribution  Is  again  common  knowledge;  this  means, 
for  example,  that  decisionmaker  1  has  perfect  knowledge  of  how  decisionmaker 
2  models  decisionmaker  1'e  behavior.  Thus,  in  Harsanyl's  theory  of  games  uf 
Incomplete  Information,  the  assumption  of  common  knowledge  has  been  moded  one 
level  higher,  to  common  knowledge  of  a  probability  distribution  rather  than  of 
a  specific  value.  However,  this  assumption  still  Implies  that  each  decision¬ 
maker  knows  very  well  the  thinking  process  of  other  decisionmakers. 

Given  the  multiple  possibilities  for  models  of  human  decisionmaking 
described  In  Appendix  A,  we  felt  that  the  specific  parameters  of  an  Individual 
decisionmaker  would  be  known  primarily  to  himself,  and  not  to  the  other  decl- 
slotimakers.  There  are  four  possible  classes  of  approaches  for  mathematically 
modeling  the  information  which  each  decisionmaker  has  concerning  the  beha¬ 
vioral  characteristics  of  the  other  decisionmakers.  These  are: 

1 .  Common  knowledge , 

2.  Private  knowledge  with  Imperfect  Information, 

3.  Private  knowledge  with  Incomplete  Information,  and 

4.  Secret  knowledge. 

Modeling  behavioral  characteristics  Information  as  common  knowledge  assumes 
that  each  decisionmaker  knows  every  other  decisionmaker's  behavioral  charac¬ 
teristics,  and  this  information  is  comiaon  knowledge  In  the  sense  of  [2]. 
Modeling  the  Information  as  private  knowledge  with  imperfect  information  means 
that  there  Is  a  joint  probability  distribution  over  all  the  decisionmakers' 
possible  behavioral  characteristics.  This  probability  distribution  is  itself 
common  knowledge  among  decisionmakers;  in  addition,  each  decisionmaker  is 
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provided  with  partial  information  concerning  the  behavioral  characteristics 
of  the  other  decisionmakers. 

Modeling  the  information  as  private  knowledge  with  Incomplete  information 
is  similar,  except  that  there  Is  no  overall  joint  probability  distribution 
which  is  common  knowledge;  in  this  case,  each  decisionmaker  must  subjectively 
construct  this  probability  distribution.  Modeling  this  Information  as  secret 
information  means  that  each  decisionmaker  has  a  subjective  model  of  every 
other  decisionmaker,  and  any  differences  among  the  models  held  by  different 
decisionmakers  is  secret  knowledge. 

From  a  mathematical  perspective,  modeling  human  characteristics  of  deci¬ 
sionmakers  as  common  knowledge  is  the  approach  which  is  most  akin  to  game 
theory.  Essentially,  human  characteristics  of  decisionmakers  would  be  incor¬ 
porated  into  the  rules  of  the  game.  For  example,  risk-averse  behavior  [24] 
would  be  represented  as  a  factor  in  a  decisionmaker's  utility;  similarly,  any 
biases  in  information  processing  would  be  Incorporated  in  the  rules  for  up¬ 
dating  of  probabilities.  Such  modification  to  the  expected  utility  paradigm 
would  result  in  the  violation  of  several  Important  game  theory  results.  For 
example.  Von  Neumann's  normalization  principle  [1],  which  states  that  a  game 
is  extensive  form  can  be  reduced  to  an  equivalent  game  in  normal  form,  may  not 
be  applicable  if  probabilities  do  not  evolve  according  to  Bayes'  rule  or  if 
strategies  are  not  selected  according  to  the  expected  utility  model. 

Although  convenient  frcm  a  mathematical  perspective,  the  common  knowledge 
approach  is  the  least  satisfactory  frcm  a  modeling  perspective,  because  it 
assumes  that  each  decisionmaker  has  a  very  accurate  model  of  the  other  deci¬ 
sionmakers.  The  private  knowledge  with  Imperfect  information  approach  makes 
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weaker  assumptions  concerning  the  knowledge  provided  to  decisionmakers  cor- 
cernlng  the  knowledge  provided  to  decisionmakers  concerning  the  behavior  ol 
other  decisionmakers.  However,  the  Imperfect  knowledge  Is  modeled  by  a  proba¬ 
bility  distribution  which  Is  common  knowledge.  It  Is  not  clear  that  in  multi¬ 
person  decision  situations,  such  a  state  of  common  knowledge  can  be  achieved 
without  an  extensive  cooperative  bargaining  session  to  agree  on  this  proba¬ 
bility.  For  many  military  situations  of  Interest,  this  aay  not  be  possible. 

From  a  practical  perspective,  the  last  two  approaches  capture  the  sub¬ 
jective  nature  of  human  decisionmaking  best.  In  the  private  knowledge  with 
Incomplete  Information  spproach,  each  decisionmaker  can  subjectively  estimate 
a  probability  distribution  for  the  decisionmaking  characteristics  of  every 
other  decisionmaker.  However,  these  estimates  are  subjective  estimates,  and 
there  Is  no  reason  that  they  must  be  consistent  across  decisionmakers.  In  the 
secret  knowledge  approach,  each  decisionmaker  models  the  human  characteristics 
of  other  decisionmakers  as  values  rather  than  probability  distributions  and 
assumes  that  these  values  are  correct.  Again,  the  fact  that  these  values  may 
be  Incorrect  is  secret  knowledge  to  each  decisionmaker.  In  both  of  these 
appioaches,  each  decisionmaker  has  a  subjective  model  of  how  he  and  other 
decisionmakers  will  make  decisions.  However,  these  models  need  not  be  consis¬ 
tent  acrcss  decisionmakers.  This  leads  to  a  number  of  Interesting  questions 
which  we  iressed  In  our  research. 

1.  I'.o  qualitative  results  characterizing  outcomes  of  multiperson 
tiexision  situations  change:  due  to  the  subjective  model  dif¬ 
ferences  among  decisionmakers?. 

2.  How  do  decisionmakers  interpret  information  they  receive 
from  other  decisionmakers? 

3.  How  do  they  Incorporate  this  Information  to  form  their 
decisions? 


Do  the  decisionmakers  realize  during  the  play  of  the  game 
that  their  decision  models  are  inconsistent? 

5*  Do  those  Inconsistencies  prevent  the  decisionmakers  from 
reaching  a  desirable  outcome? 

6«  When  Inconsistencies  are  detected,  how  do  the  decisionmakers 
modify  their  subjective  models? 

In  order  to  study  these  questions  in  a  specific  context,  we  considered 
two  classes  of  multiperson  decisionmaking  problems: 

1.  Consensus  Problems,  and 

2«  iVo-Person  nonzero  sum  games  of  incomplete  Information, 

In  consensus  problems,  multiple  decisionmakers  with  private  Information  and 
a  common  goal  are  trying  to  each  a  consensus  decision  by  proposing  tentative 
decisions  recursively  among  themselves.  Consensus  problems  are  a  simple  class 
of  cooperative  decision  problems  emphasizing  the  implicit  information  transfer 
among  decisionmakers  through  the  choice  of  decvslons  (signaling).  Under 
simple  communication  conditions,  the  results  cf  [29]~[3A]  establish  that.  In 
the  absence  of  sutjectlve  model  differences,  a  consensus  Is  always  reached. 
However,  differences  in  subjective  models  among  declslonmakexs  may  lead  to 
misunderstandings  In  the  signaling  processes,  thereby  preventing  the  decision- 
makers  from  reaching  a  consensus. 

Our  first  Investigation  of  these  problems  is  summarized  in  the  paper  In 
Appendix  B,  In  this  paper,  we  show  that,  for  a  specific  consensus  problem  of 
estimating  the  probability  of  an  event,  when  the  Individual  differences  among 
decisionmakers  are  secret  knowledge,  the  consensus  process  can  reach  a  state 
of  contradiction.  In  this  state,  the  signals  from  one  decisionmaker  cannot 
be  Interpreted  by  another  decisionmaker  due  to  the  secret  difference  between 
their  models.  Thus,  the  secret  Information  that  the  subjective  models  were 


different  becomes  common  knowledge.  In  Appendix  C,  we  show  thst  this  phenom¬ 
enon  Is  typical  of  general  consensus  problems  when  differences  In  subjective 
decisionmakers  are  secret  knowledge.  Furthermore,  we  establish  that  there  are 
only  two  classes  of  qualltltlve  outcomes  possible:  a  consensus  Is  reached, 
so  the  differences  In  subjective  models  remain  secret  knowledge,  or  a  contra¬ 
diction  Is  reached,  so  the  differences  In  subjective  models  become  common 
knowledge. 

One  question  which  was  not  addressed  In  the  results  in  Appendices  B  and 
C  concerned  the  likelihood  of  the  two  classes  of  outcomes.  For  specific 
problems,  how  likely  wss  a  contradictory  outcome?  In  Appendix  D,  we  develop 
a  number  of  results  which  provide  answers  to  this  question.  Specifically,  we 
derive  conditions,  depending  on  the  type  of  consensus  problem,  which  charac¬ 
terize  whether  the  set  of  subjective  models  which  result  in  contradictions  Is 
a  dense  set  In  the  set  of  possible  models,  using  a  specific  topology.  We  also 
derive  conditions  which  characterize  when  the  consensus  process  is  robust  to 
small  differences  in  subjective  models. 

In  the  paper  In  Appendix  0,  we  also  study  the  consensus  problem  when  the 
differences  among  subjective  models  are  represented  as  prlvste  Information 
with  either  Incomplete  or  Imperfect  knowledge.  That  Is,  there  Is  a  second- 
order  probability  model  which  describes  each  decisionmaker's  subjective 
beliefs  concerning  the  decision  model  of  the  other  decisionmakers.  The 
results  in  Appendix  D  establish  that  the  two  classes  of  qualitative  outcomes 
described  previously  can  occur.  In  addition,  there  is  yet  a  third  class  of 
outcomes,  which  can  be  described  as  follows.  In  these  outcomes,  the  consensus 
process  reaches  a  state  where  each  decisionmaker  cannot  learn  any  additional 


Infonation  froa  the  tentettve  dccielone  cooaunlcaeed  by  other  decleionaakers. 
However,  there  Is  a  significant  difference  In  Information  among  decision¬ 
makers,  so  that  a  consensus  la  not  reached. 

The  result  In  Appendix  D  la  very  surprising.  Coosld*r  •  consensus  pro¬ 
cess  where  two  decisionmakers  have  exactly  the  same  decision  model.  According 
to  the  results  of  (32]..  a  consensus  will  be  reached  eventually.  Hotrever, 
assume  now  that  both  decisionmakers  have  exactly  the  acme  decision  model,  but 
they  do  not  know  It  for  certain!  The  results  in  Appendix  D  show  that,  in  this 
situation,  this  lack  of  certainty  can  lead  the  consensus  piocess  to  a  stalled 
state,  from  which  a  consensus  cannot  be  reached.  In  essence,  the  lack  of 
certainty  prevents  the  consensus  process  from  transmitting  enough  information 
to  achieve  a  consensus  outcome. 

This  result  has  Important  Implications  for  more  general  clasaes  of  multi¬ 
person  decision  problems,  such  as  cooperative  bargaining  problems.  Loosely 
Interpreted,  It  states  that  the  preaence  of  uncertainty  concerning  behavioral 
characteristics  of  the  bargaining  decisionmakers  can  lead  to  a  stalemate  In 
the  bargaining  process.  In  practical  situations,  this  stalemate  can  be  broken 
by  changing  the  nature  of  the  bargaining  process,  such  as  bringing  in  an 
arbitrator. 

In  the  area  of  nonzero  sum  games  of  Incomplete  Information,  we  considered 
a  simple  class  of  two-person  symmetric  games.  We  developed  an  axiomatic  for¬ 
mulation  describing  Che  subjective  decision  model  approach  discussed  above. 
Based  on  this  formulation,  we  studied  the  properties  of  Che  rational  strate¬ 
gies  and  outcomes  of  both  the  static  and  infinitely-repeated  versions  of  the 
game.  The  results  in  Appendix  E  show  that,  under  the  assumption  that  each 


dcciaionaakcr  is  subjectively  rational  according  to  his  ovn  internal  aodel* 
in  the  static  case,  the  existence  of  differences  between  subjective  aodels 
bccoaes  conon  knowledge.  However,  in  the  Infinitely-repeated  g*ae,  these 
differences  are  aaellorated  during  the  play  of  the  gane,  so  that  there  exist 
core  equillbriua  strategies  which  are  rational  from  each  decisionmaker's 
perspective.  In  Appendix  E,  we  show  that  whether  the  subjective  ditierences 
become  common  knowledge  or  cannot  depend  on  the  specific  bargaining  strategy 
used  for  selecting  among  potential  core  equilibrium  outcomes. 

In  sum,  our  research  has  investigated  two  classes  of  mathematical  models 
of  multiperson  decisionmaking  in  dynamic  situations  which  focus  on  the  sub jec- ' 
tive  nature  of  human  behavioral  characteristics  in  information  processing, 
option  evaluation  and  action  selection.  Our  mathematical  formulation  is  sim¬ 
ilar  in  spirit  to  Kndane  and  Larkey's  [17],  In  that  individual  rationality  is 
defined  ih  terms  of  subjective  aodels.  However,  we  do  not  propose  to  replace 
the  game- theoretic  models  in  these  subjective  models  by  decision  theory  models 
using  subjective  probabilities  pf  action  for  other  decisionmakers.  Instead, 
we  have  developed  a  mathematical  formulation  which  allows  for  differences  in 
subjective  game  theory  models  among  decisionmakers.  Our  research  focused  on 
studying  the  qualitative  Impllcatlcns  of  these  differences  in  cooperative  and 
noncoop-^rative  paradigms.  The  results  developed  in  the  appendices  to  this 
report  shed  considerable  light  on  these  implications  in  dynamic  declslcnmaklng 
under  uncertainty. 

Nevertheless,  many  technical  problems  remain  to  be  addressed.  .Perhaps 
the  key  question  remaining  unresolved  is  how  to  effect  the  resolution  of  model 

♦For  an  interesting  discussion  of  the  application  of  internal  models  to  repre¬ 
senting  Navy  Commard  and  Control  decisionmaking,  see  Athans  [35]. 
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differences  once  they  are  discovered.  The  approach  proposed  In  Appendix  D 
requires  Chat  an  inference  problen  be  started  at  that  tine  to  properly  iden¬ 
tify  the  sodel  differences.  This  approach  does  not  indicate  how  to  resolve 
these  differences  to  achieve  a  desirable  qualitative  outcoae,  either  through 
a  coenaon  calibration  process  or  a  bargaining  process. 

The  mibjective  gaae  fraacwork  which  was  used  throug'.iout  our  research 
provides  a  foundation  for  the  developnent  of  a  quantitative  predictive  theory 
of  sultiperson  declslooBaklng.  In  our  opinion,  the  next  aajor  step  is  the 
developswat  of  specific  behavioral  theories  and  aathematical  aodels  of  these 
theories  for  busMn  decislonnaking  in  dynamic  aultlperson  decision  situations* 
As  seen  in  the  survey  in  Appendix  A,  the  state  of  the  research  in  Individual 
decisiori  theory  has  focused  on  developing  alCernatives  to  Che  expected  utility 
paradigm  [1|;  Che  numerous  competing  alternatives  must  be  narrowed  down  and 
shown  to  be  superior  to  the  expected  utility  paradigm.  Furthermore,  addl- 
tliinal  research  Is  needed  to  focus  on  the  ssiltlperson  aspects  of  behavioral 
th'.-orlas  of  Information  processing  and  option  selection.  Based  on  these 
results,  specific  quantitative  models  of  mulclperson  decisionmaking  can  be 
developed  and  evaluated  in  empirical  research.  These  research  directions  are 
being  followed  in  many  research  programs,  so  Chat  progress  towards  the  devel¬ 
opment  of  a  quantitative  predictive  theory  of  mulclperson  decisionmaking  will 


continue 
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APPENDIX  A 

AN  OVERVIEW  OF  RESULTS  IN  HUMAN  DECISIONMAKING 

A.l  INTRODUCTION 

In  this  appendix,  we  overview  some  of  the  principal  reaulcs  available  In 
the  literature  concemlnR  eoplrlcal  studies  of  how  huaans  sake  decisions.  We 
have  organized  these  results  into  two  sections:  Human  Inference,  dealing  with 
the  problem  of  situation  assessment  and  hypothesis  evaluations,  and  utility 
theory,  dealing  with  the  problems  of  option  evaluation  and  option  selection. 
For  a  more  detailed  review  of  these  disciplines  the  reader  is  referre*  to  the 
excellent  surveys  and  books  by  Rapoport  and  Wallsten  [1972],  Slovic,  Fishoff 
and  Lichtenstein  (1977],  Einhom  and  Hogarth  (1981],  Kahneman,  Slovic  and 
Tversky  (1982],  Schoemaker  (1982],  fishburn  [1982]  and  Machlna  [1983]. 

A. 2  HUMAN  BEHAVIOR  IN  SITUATION  ASSESSMENT 

Since  most  of  the  decision  tasks  involve  uncertainty,  considerable  effort 
has  been  spent  in  studying  how  people  formulate  and  change  their  opinions 
about  uncertain  hypotheses.  The  literature  in  the  area  of  subjective  proba'^ 
bllity  assessment  and  revision  of  opinion  shoys  two  different  approaches  to 
the  modeling  problem.  The  first  approach,  advanced  by  statisticians  and 
psychologists,  is  based  on  probability  theory  and  statistics  and  relies  on 
the  concept  of  a  "statistical  man"  -  cn  optimal,  Bayesian  inferer  (observer). 
Bayes'  rule  provides  a  normative  representation  of  how  a  decisionmaker  should 
revise  his  probability  estimates  on  the  basis  of  new  information.  The 
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descriptive  considerations  have  been  handled  primarily  by  adjusting  the 
functional  form  of  the  normative  model.  This  approach  led  to  the  study  of 
’‘conservatism"  -  a  suboptimal  human  behavior  that  produces  posterior  proba¬ 
bilities  nearer  to  the  prior  probabilities  than  those  specified  by  Bayes* 
rule  (Edwards  and  Phillips  [1964];  Peterson  and  Beach  (1967)).  The  second 
approach,  proposed  mainly  by  psychologists,  argues  that  the  human  is  a 
selective,  sequential  information  processor  with  limited  capacity  (Hogarth 
[1975]).  It  is  hypothesized  that  this  limited  information-processing  capacity 
leads  him  to  apply  simple  heuristics  and  cognitive  strategies  which  reduce 
the  complex  tasks  of  assessing  probabilities  and  predicting  values  to  simpler 
Judgmental  operations.  Much  of  the  work  on  this  Judgmental  heuristics  has 
been  performed  by  Tversky  and  Kahneman  [1971],  [1973],  [1974].  They  demon¬ 
strated  that  three  Judgw'ntal  heuristics  -  representativeness,  availability, 
adjustments  and  anchoring  -  v'*.termine  probabilistic  inferences  in  sutny  tasks. 
However,  these  findings  can  only  described  in  qualitative  terms  and,  as 
yet,  no  quantitative  descriptive  theory  based  on  the  heuristics  has  emerged. 

A. 2. I  Bayesian  Revision  of  Opinion 

Bayes*  rule  has  provided  much  of  the  Impetus  to  the  research  on 
normative-descriptive  modeling  of  Judgmental  processes.  A  basic  hypothesis 
is  that  opinions  (Judgments)  should  be  expressed  in  terms  of  subjective  proba¬ 
bilities  and  that  the  optimal  revision  of  such  opinions  must  be  accomplished 
via  Bayes*  rule.  A  considerable  number  of  studies,  involving  mainly  binomial 
(so  called  "bookbag  and  poker  chip"  games)  and  multinomial  tasks,  have  com¬ 
pared  subjects'  numerical  probability  assessments  to  those  predicted  by  Bayes* 
rule  (Edwards  and  Phillips  [1964];  Peterson  and  Beach  [1967],  Donnel  and 
Ouchrame  [1975]).  A  general,  but  by  no  means  universal,  conclusion  has  been 


that  the  eatlmates  were  aonotonlcally  related  to  those  specified  by  Bayes* 
rule,  but  were  conservative.  That  is,  the  posterior  probabilities  estimated 
by  subjects  were  nearer  to  the  prior  probabilities  than  those  obtained  via 
Bayes*  rule.  Several  explanations  are  offered  for  the  phenomena  of  conserva¬ 
tism.  It  is  believed  that  conservatism  is  due,  in  small  part,  to  procedural 
variables  (e.g.,  payoffs  and  incentives,  sample  size,  sequential  ordering  of 
the  data,  prior  probabilities,  etc.)  and,  in  large  part,  to  subjects*  mlsper^ 
caption  of  the  underlying  sampling  distributions,  mlsaggregatlon  of  the  data, 
or  simply  response  bias.  Misperception  is  generally  attributed  to  the  mis¬ 
match  between  aubjective  and  actual  (objective)  probability  dlatrlbutlons 
(Peterson,  Duchrame  and  Edwards  [1968];  Wheeler  and  Beach  [1968], 

Lichtenstein  and  Feeney  [1968]),  and  to  the  human  tendency  to  discount  Che 
importance  of  rare  events  when  they  occur  (Vlek  and  Van  der  Hejden  [1967]). 
Mlsaggregatlon  refers  to  the  nonoptimal  sequential  revision  of  subjective 
probabilities  and  has  been  advanced  as  the  major  source  of  conservatism 
primarily  by  Edwards  and  his  associates  (Edwarc:;  [1968],  Edwards,  Phillips, 
Hayes  and  Goodman  [1968]).  The  notion  of  response  bias  was  advanced  by 
Peterson  [1968]  and  is  related  Co  subjects*  unwillingness  to  use  extreme 
numbers  and  odds.  A  comprehensive  review  of  the  issue  of  conservatism  is 
provided  by  SI o vie  and  Lichtenstein  [1971],  Rapoport  and  Vallsten  [1972]  and 
Slovic,  Flshoff  and  Lichtenstein  [1977]. 

Along  different  lines,  Kahneraan  and  Tversky  [1973],  Tversky  and  kahneman 
[1974]  and  Grether  [1980]  among  others,  have  found  that  probability  updating 
underweighs  prior  information  and  overweighs  the  representativeness  of  the 
current  sample.  This  phenomena  is  similar  to  Tversky  and  Kahneman's  [1971] 
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law  of  small  numbers.  Tversky  and  Kahneman  {1983]  have  also  found  biases  In 
the  combination  of  evidence  from  independent  and  correlated  sources. 

In  an  attempt  to  overcome  the  descriptive  deficiencies  of  the  Bayesian 
model,  several  empirical  modifications  to  Bayes*  rule  have  been  offered. 

These  modifications  can  be  embedded  into  a  generalized  version  of  Bayes*  rule 
where  an  additional  term  is  multiplied  to  the  likelihood  of  each  hypothesis. 
This  term  is  called  a  disability  or  impediment  function  in  Edwards  and 
Phillips  [1964],  and  la  supposed  to  capture  the  suboptlmal  nature  of  human 
information  processing  (Snapper  and  Frybach  [1971]). 

Although  Bayesian  revision  of  opinion  can  be  studied  as  a  separate 
phenomena,  it  is  most  useful  when  interwoven  with  decisionmaking  and  action 
selection.  The  posterior  probabilities  of  various  hypotheses  (states  of 
nature)  can  be  used,  in  combination  with  information  about  payoffs  associated 
with  various  decisions  and  states  of  nature,  to  maximize  the  (subjective) 
expected  value,  the  (subjective)  expectrd  utility,  or  whatever  criterion  of 
optimality.  Human  performance  modeling  in  signal-detection  tasks  exemplifies 
this  approach.  The  task  of  the  subject  is  to  decide  whether  or  not  a  signal 
is  present  in  a  block  of  observations. 

Signal-detection  experiments  have  been  conducted  in  a  wide  variety  of 
contexts.  Examples  are  experiments  in  sound  localization  (Voelcker  [1961]), 
detection  of  movement  (Klnchela  and  Allen  [1969]),  speech  recognition  (Egan, 
et  al.  [1961]),  and  recognition  of  memorized  words  (Parks  [1966]).  A  com¬ 
prehensive  exposition  of  signal-detection  theory  and  psychophysics  has  been 
provided  by  Green  and  Swets  [1966],  and  a  fine  summary  of  the  theory  is  pro¬ 
vided  by  Sheridan  and  Parrel  [1974J. 


The  experiaental  results  show  that  the  human  performance  Is  monotonically 
related  to  those  predicted  by  the  model.  It  Is  possible  to  manipulate  subjec¬ 
tive  decision  thresholds  (criteria)  by  varying  prior  probabilities  and  pay¬ 
offs.  However,  the  amount  of  change  has  been  found  to  be  less  than  optimal. 
The  subjects  also  have  difficulty  aggregating  Information  across  a  sequence 
of  trials  (Swets  and  Green  11961])  -  a  tendency  similar  to  conservatism  In 
Bayesian  revision  of  opinion. 

An  additional  effect  in  human  Inference  which  cannot  be  represented  uring 
impediment  funtlons  is  the  asymmetry  between  effect-cause  Inferences  a^l 
cause-effect  Inferences  noted  In  AJsen  [1977],  Tversky  and  Kahneman  [1980a] 
and  Elnhorn  and  Hogarth  [1981].  The  empirical  evidence  Indicates  that  Infor¬ 
mation  which  receives  a  causal  interpretation  Is  weighed  more  heavily  In 
judgment  than  Information  that  is  diagnostic.  Tversky  and  Kahneman  [1980] 
correlate  their  results  with  previous  research  (Janls  [1972])^  (Jervis  [1975]) 
indicating  that  humans  overestimate  the  accuracy  of  uncertain  models  In  pre¬ 
dicting  behavior,  and,  when  confronted  with  evidence  concerning  the  errors  In 
their  models,  would  rather  find  a  plausible  explanation  than  revise  their 
models,  'ihese  results  are  particularly  relevant  In  taultlperson  decision¬ 
making,  where  the  Interactions  among  decisionmakers  force  each  decisionmaker 
to  develop  internal  models  of  the  other  decisionmakers. 

A. 2. 2  Judgmental  Heuristics  and  Biases 

Recent  research  on  probabilistic  judgments  has  focused  on  the  discovery 
and  description  of  heuristics,  or  simple  cognitive  strategies,  that  are  em¬ 
ployed  In  the  quantification  of  uncertainty.  Much  of  this  work  has  been  per¬ 
formed  by  Tversky  and  Kahneman  [1971],  [1973],  [1974].  Their  research  centers 


on  the  deteralnatlon  of  how  people  evaluate  uncertainty  rather  than  how  well 
they  evaluate  It.  Tversky  and  Kahneman  demonstrated,  qualitatively,  that 
three  Judgmental  heuristics  -  representativeness,  availability,  adjustment  and 
anchoring  -  determine  probabilistic  judgments  In  a  variety  of  tasks.  These 
iqualltatlve  findings  are  potentially  useful  In  the  development  of  a  (as  yet 
elusive)  quantitative,  descriptive  theory  of  judgment. 

1 .  REPRESENTATIVENESS  HYPOTHESES 

Tversky  and  Kahneman  [1972]  hypothesized  thnt  people  evaluate  the  proba¬ 
bility  of  an  event  on  the  basis  of  the  degree  of  slmilaiity  between  the  event 
and  the  evidence  they  have  examined.  If  the  degree  of  similarity  Is  high, 
then  the  probability  of  the  event  Is  judged  to  be  high.  It  was  demonstrated 
that  the  representativeness  heuristic  can  explain  people's  Intuitive  predic¬ 
tions  that  were  at  variance  with  the  normative  judgments.  This  was  accom¬ 
plished  by  showing  the  Insensitivity  of  the  representativeness  heuriS;tic  to 
several  normatively  Important  factors  of  judgment,  viz.,  the  representative¬ 
ness  heuristic  Is  liable  to  be  used  when  the  general  properties  of  events  are 
emphasized. 

11.  AVAIIABILITY  HEURISTIC 

With  this  heuristic,  people  evaluate  the  probability  of  an  event  on  the 
basis  of  the  ease  with  which  instances  or  occurances  can  be  recalled  or  imag¬ 
ined.  Availability  is  a  valid  cue  for  the  assessment  of  probability  because, 
in  general.  Instances  of  more  frequent  events  are  recalled  more  easily  than 
the  Instances  of  less  frequent  events.  However,  availability  Is  also  affected 
by  other  factors  unrelated  to  probability.  Consequently,  availability  heuris¬ 
tic  results  have  been  used  in  systematic  biases,  some  of  which  follow: 


(1)  Biases  due  to  retrievablllty  of  Instances:  an  event  whose 
Instances  are  easily  recalled  will  appear  more  frequently 
than  an  event  of  equal  probability  (or  frequently)  whose 
Instances  are  less  easily  recalled. 

(2)  Biases  due  to  the  effectiveness  of  a  search  set:  In  tasks 
requiring  the  estimates  of  the  relative  frequencies  of 
words,  the  availability  heuristic  leads  to  a  judgment  that 
the  frequency  of  occurrence  of  abstract  words  (e.g.,  love 
in  love  stories)  is  touch  higher  than  the  concrete  words 
(e.g.,  door). 

(3)  Biases  of  Imaginabllity:  in  tasks  in  which  one  must 
assess  the  frequency  of  an  event  whose  instances  are  not 
stored  in  memory,  one  may  generate  the  instances  according 
to  an  algorithm  or  a  riile.  The  ease'  with  which  one  cun 
generate  Instances  forms  the  basis  for  probability  or  fre¬ 
quency  assessoient .  Depending  on  the  nature  of  the  rule, 
this  mode  of  probability  assessment  may  lead  to  serious 
biases. 

(4)  Illusory  correlation:  The  probability  of  how  frequently 
two  events  co-occur  is  related  to  the  associative  bond 
between  them.  Strong  associates  will  be  judged  to  have 
occured  frequently  together. 

It  is  stated  that  the  availability  heuristic  Is  vsed  when  events  are  thought 
of  in  terms  of  specific  instances. 

lli.  ADJUSTMENT  AND  ANCHORING 

With  this  heuristic,  an  Initial  value  or  anchor  is  used  as  a  first 
approximation  to  the  judgment.  The  initial  value  is  then  adjusted  according 
to  the  information  provided.  Typically,  these  adjustments  are  imperfect  and 
insufficient.  Even  payoffs  for  accuracy  did  not  reduce  this  effect.  It  was 
demonstrated  that  anchoring  could  explain  people’s  judgment  of  the  probabili¬ 
ties  of  conjunctive  and  disjunctive  events  and  their  assessment  of  the  vari¬ 
ability  of  prcbablllty  distributions. 

Studies  of  decisions  under  risk  indicate  that  people  tend  to  overestimate 
the  probability  of  conjunctive  events  and  underestimate  the  probability  of 
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disjunctive  events.  With  regard  to  the  variability  of  probability  distribu¬ 
tions,  several  Investigators  found  that  subjects  state  more  narrow  confidence 
intervals  than  are  Justified  by  the  evidence  presented  to  them.  Edwards 
[1975]  noted  that  this  peculiarity  is  dependent  entirely  on  the  format  of 
questions. 

A  comprehensive  review  of  heuristics  in  probabilistic  J’^lgment  has  been 
provided. by  Slovir,  Flshoff  and  Lichtenstein  [1977].  Although  the  evidence 
suggests  that  heuristics  are  employed  in  the  assessment  of  uncertainty,  the 
specific  heuristic  selected,  the  way  it  is  used,  and  the  quality  of  Judgment 
it  provides,  are  all  highly  problem  dependent.  Therefore,  heuristics  may  be 
thought  of  as  explanatory  psycholc  ,ical  processes  of  human  probabilistic 
Judgment  and  cannot  be  regarded  as  a  general  theory  of  judgment. 

A. 3  EMPIRICAL  RESEARCH  IN  UTILITY  THEORY 

As  stated  in  Machlna's  survey  [1983],  the  theoretical  underpinnings  of 
single  person  decsion  theory  under  uncertainty  are  based  on  the  expected 
utility  hypothesis  of  individual  behavior.  This  hypothesis  essentially  states 
that,  when  faced  with  alternative  risky  prospects  over  a  set  of  outcomes,  a 
rational  decisionmaker  will  always  choose  a  prospect  which  yields  the  highest 
expectation  of  some  utility  function  defined  over  the  set  of  outcomes.  This 
utility  function,  often  called  a  Von  Neumann-Mcrgenstern  utility  function, 
represents  the  preference  of  individuals  over  the  different  outcomes. 

As  noted  in  Fishburn  [1982],  the  existence  of  Von  Neumann-Morgenstern 
utility  function  and  the  validity  of  the  expected  utility  hypothesis  follow 
from  some  variations  of  the  following  three  axioms: 


(1)  Transitivity  of  preferences  over  prospects;  i>e.,  if 

pro  pect  A  is  weakly  preferred  to  prospect  3,  and  prospect 
B  is  weakly  preferred  to  prospect  C,  then  prospect  A  is 
weakly  preferred  to  prospect  C. 

(2)  Continuity  of  preferences  are  prospects;  i.e.,  if  two 
sequences  of  prospects  A„,  are  such  that  A,,  is  weakly 
preferred  to  and  A^,  converge  to  A,  B  respectively 
(in  the  topology  of  weak  convergence),  then  A  is  weakly 
preferred  to  B. 

(3)  Independence  of  preferences  over  common  alternatives; 
l.e.,  if  A  is  weakly  preferred  to  B,  then,  ior  any  C,  and 
any  0<  o<l,  aA+  (1-a)  C  is  weakly  preferred  to  oB+(l-a)C. 

These  axioms  appear  to  be  simple  conditions  «rhlch  the  preferences  of  any 
rational  decisionmaker  should  satisfy.  Hence,  empirical  results  concerhing 
the  validity  of  these  axioms  have  been  limited.  As  Machine  states:  njnfortu- 
nately,  though  no  doubt  due  In  part  to  the  widely  held  belief  in  the  Inherent 
''rationality*'  of  the  expected  utility  axioms  and  in.  part  to  the  tremendous 
success  of  the  theoretical  developments  during  this  period,  the  last  few 
decades  have  seen  nowhere  near  the  amount  of  empirical  estimation  and  testing 
by  economists  that  such  a  widely  used  model  of  behavior  ought  to  have  re¬ 
ceived."  In  this  section,  we  will  overview  some  of  the  available  results  on 
empirical  research  in  utility  theory. 

,  One  of  the  first  empirical  studies  concerned  the  risk  behavior  of  Indi¬ 
viduals,  and  their  desire  both  to  participate  in  high-stakes  risk-seeking 
lotteries,  and  risk-averse  Insurance  purchases.  Friedman  and  Savage  [1948] 
postulated  that  the  utility  function  should  be  a  function  that  is  concave, 
"locally  risk  averse",  about  low  wealth  levels,  and  convex  (risk  seeking)  at 
high  wealth  levels.  Hence,  an  individual  whose  current  wealth  position  was 
near  the  Inflection  point  would  indeed  purchase  both  Insurance  against  losses 
and  lottery  tickets  offering  a  small  chance  of  large  gains.  This  utility 
function  implies  two  other  commonly  observed  aspects  of  individual  preferences 
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over  uncertain  prospects.  First,  Individuals  prefer  increases  in  risk  in  the 
upper  tails  of  already  random  wealth  distributions  over  risk  increases  in  the 
lower  tails  of  such  distributions.  Second,  individuals  prefer  distributions 
with  large  right  taHs  over  large  left  tails  (Kraus  and  Litzenberger  (1976J, 
Scotc  .>nd  Horvath  {1980]). 

There  is  evidence  to  suggest  that  the  Friedman-Savage  characterization 
of  the  utility  functions  (risk-averse  over  biases,  risk-seeking  bver  gains) 
is  incomplete.  Aabng  others,  Hershey  and  Schoemaker  (1980a],  (1980b]  and 
Kahneman  and  Tversky  (1979]  have  found  both  risk-averse  and  risk-seeking  behav¬ 
ior  In  loss  situations.  Kahneman  and  Tversky  (1979]  attribute  this  observed 
behavior  to  nonllnearltles  in  the  probabilities.  Hershey  and  Schoemaker 
(1980a]  also  found  that  problem  representations  affected  the  decisionmaking 
behavior.  The  differences  in  behavior  introduced  by  problem  representation 
were  observed  in  many  other  studies  (Slovlc  (1969a],  Schoemaker  and  Kunreuther 
(1979],  Tversky  and  Kahneman  (1981],  Kahneman  and  Tversky  (1982])  and  led  to 
the  theory  of  "framing,"  where  the  mental  point  for  defining  trttat  is  a  gain 
versus  what  is  a  loss  depends  on  the  specific  wording  of  the  decision  problem. 

There  have  been  a  number  of  empirical  studies  to  test  the  validity  of  the 
axioms  of  expected  utility  theory.  The  most  critical  of  these  axioms  is  the 
Independence  axiom.  As  noted  in  Fishburn  (1982]  the  Independence  axiom  is ' the 
primary  normative  assertion  of  expected  utility  theory,  because  it  restricts 
the  expected  utility  to  be  a  linear  functional  of  the  probability  distribution 
over  the  set  of  outcomes.  However,  examples  such  as  Allais'  paradox  (19b3j 
have  produced  evidence  that  this  axiom  is  violated  often  (Allais  and  Hagen 
(  1979),  Half  fa  (1968),  Slovlc  and  Tversky  (1974),  Kahneman  and  Tversky  (1979), 
MacCrimmon  and  larsson  (1979)). 
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The  assunpcion  of  transitive  preferences  has  also  been  tested  empiri¬ 
cally.  Early  studies  in  utility  theory  (Arrow  [1951])  indicated  that  in  many 
choice  situations,  preferences  were  not  transitive.  Empirical  research  by 
Edwards  [1954],  Weinstein  [1968]  and  Tversky  [1969]  have  produced  examples 
of  intransitive  preferences  among  alternatives.  In  subsequent  research, 
Lichtenstein  and  Slovic  [1973]  have  found  that,  when  faced  with  a  direct 
choice  of  prospects  A  and  B,  decisionmakers  prefer  A  to  B  even  though  they 
assign  a  higher  certainty  equivalent  value  to  B  than  to  A. 

Another  systeisatlc  violation  of  the  expected  utility  hypothesis  arises  In 
the  context  of  choice.  Specifically,  the  expected  utility  hypothesis  predicts 
that  the  prospect  which  maximizes  expected  utility  will  always  be  selected. 
However,  when  repeatedly  confronted  with  the  same  (>air  of  prospects,  worded  in 
the  same  manner,  individuals  will  not  always  make  the  same  choice.  The  early 
results  of  Hosteller  and  Nogee  [1951]  established  that  the  individual's  choice 
probabilities  were  continuous,  monotonolc  functions  of  the  differences  in  the 
expected  utilities  of  the  prospects.  Explicit  models  of  randomized  choice 
have  been  proposed  by  Luce  [1959]  and  Flshbum  [1976],  [1978]  among  others. 

At  a  more  fundamental  level,  the  expected  utility  hypothesla  postulates 
the  existence  of  probabilities  which  obey  the  standard  calculus  (Savage 
[1954]}.  However,  Ellsberg's  paradox  (Ellsberg  [1961])  demonstrates  that 
rational  decisonmakers  may  weigh  the  alternatives  with  scale  factors  which  do 
not  obey  tlie  laws  of  probability.  Ellsberg’s  paradox  has  been  verified  exper¬ 
imentally  by  MacCrimmon  [1965],  Slovic  and  Tversky  [1974],  and  MacCrlmmon  and 
Larsson  [1979].  Plausible  explanations  for  this  behavior  have  been  offered 
by  Arrow  and  Hurwlcz  [1972]  In  terms  of  maxmln  choices  instead  of  maximum 
expected  utility. 
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A. 4  CONCLUSION 


Thia  appendix  has  provided  a  brief  overview  of  some  results  available 
concerning  human  decisionmaking.  The  focus  has  been  to  study  results  which 
Indicate  that  the  standard  normative  theories  such  as  probability  theory  and 
utility  theory  fall  short  of  describing  observed  decisionmaking  behavior.  The 
rest  of  the  research  in  this  report  has  been  motivated  by  the  existence  of 


these  differences. 
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ASYMPTOTIC  AGREEMENT  IN  DISTRIBUTED  ESTIMATION  WITH  INCONSISTENT  BELIEFS 

I.  INTRODUCTION 

Consider  two  agents,  1  and  2,  who  wish  to  estimate  the  same  random  vari~ 
able  X.  Initially  agent  i  (i>I,2)  observes  the  variable  (1*'1,2).  Based  on 

m  * 

his  observation  Agent  1(2)  generates  an  estimate_x^  (x^)  which  he  sends  to 
Agent  2(1).  It  Is  assumed  that  the  message  is  received  without  any  distor' 
tion.  Each  time  an  agent  receives  a  message  he'  recomputes  a  new  estimate, 
based  on  the  original  observation*  and  the  messages  received  by  the  other 
agent  up  until  that  tlme^  which  he  then  transmits  to  the  other  agent.  Several 
questions  related  to  the  evolution  of  these  sequences  of  estimates  arise: 

Will  an  agent  settle  on  a  final  estimate?  Will  the  estimates  of  the  two 
agents  eventually  agree? 

A  substantial  effort  has  been  recently  devoted  to  the  problem  of  reaching 
a  consensus  of  opinion  among  several  decisionmakers  The  crucial  as¬ 

sumption  In  (1]-(5I  Is  the  following:  All  agents  are  assumed  to  be  Bayesian. 
Agent  I's  view  of  the  world  is  represented  by  an  a  priori  distribution  p^  on 
the  space  of  “primitive"  random  variables  x,  y^,  and  p^  Is  the  same  for  all 
agents.  Under  this  assumption  the  conditions  under  which  asymptotic  agreement 
Is  achieved  are  investigated  in  [1]-E5).  A  major  result  reported  in  (3)  Is 


*We  restrict  ar tendon  to  the  case  where  each  agent  takes  one  measurement 
without  any  loss  of  generality;  the  case  where  the  agents  take  noise  measure¬ 
ments  and  communicate  simultaneously  can  be  analyzed  in  a  similar  fashion. 

**References  are  Indicated  by  numbers  in  square  brackets,  the  list  appears  at 
the  end  of  the  main  body  of  this  report. 
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the  following:  if  the  messages  exchanged  among  the  agents  are  the  conditional 
expectations  of  the  random  variable  x  then  the  agents  agree  asymptotically. 

In  this  report  w  consider  two  Cayeslan  agents  who  have  different  views 
of  the  world  and  exchange  their  conditional  expectations  of  the  random  vari¬ 
able  X.  We  show  that  asymptotically  the  two  agents  either  agree  or  they 
realize  that  they  have  different  models  and  stop  communicating  any  further. 
Agreement  or  disagreement  depend  on  the  order  of  communication. 

The  remainder  of  this  report  la  organized  u  follows:  The  model  is. 
presented  in  Section  2;  the  process  of  expectation  formation  Is  described  In 
Section  3;  the  question  of  convergence  of  the  estimates  and  of  asymptotic 
agreement  are  Investigated  In  Section  4.  Conclusions  are  presented  In  Sec- 


2.  THE  MODEL 


We  consider  two  agents,  1  and  2,  and  a  random  variable  x  which  each  agent 
wishes  to  estimate.  We  make  the  following  assumptions: 

(Al)  Both  agehts  are  Bayesian.  Agent  i's  view  of  the  world  is  repre¬ 

sented  by  an  a  priori  distribution  (lrl,2)  on  Che  space  of  the  "primi¬ 
tive'*  random  variables  X,Y^,y2,  whose  sample  space  is  0.  Agent  1  assumes 
chat  the  ocher  agent's  distribution  Is  also  Ve  do  not  assume  P^>p2. 

(A2)  Ei|x|<o  I*!, 2  where  E^  denotes  Che  expectation  with  respect  Co  Che  proba¬ 
bility  measure  p^  Induced  by  P^. 

(A3}  Each  agent  Cakes  only  one  measurement  y^  (1*1,2)  at  time  t*o*,  and  com¬ 
putes  Che  conditional  expectation  x^  of  the  random  variable  x  based  on 
his  measurement.  The  spaces  Y^  (1*1,2)  are  finite. 

(A4)  At  time  t*2K-l  (2K),  K*l,2,*»«,  agent  1(2)  sends  the  message  Xj^  (x^)  to 

agent  2(1).  Thus,  messages  are  transmitted  at  c*l,2,  in  Che  following 

-1  *2  "1  "2  *1  *2  . 

order:  ^1*^1 

The  messages  (1*1,2,  k*l,2,***)  of  each  agent  are  the  expectations  of  x 
based  on  the  original  observation  Yi  and  Che  ritessages  received  in  Che  past  by 
the  other  agent. 

*The  problem  where  the  agents  cake  more  measurements  and  communicate  simul¬ 
taneously  can  be  treated  similarly  by  considering  the  whole  sample  path 
ylt  ^  ( y^(  1 )  ,yi(2 )  ,•  •  •  ,y^  t ) )  at  any  time  t. 
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Under  assonpcions  (A1)-(A4)  we  study  Che  following  problems: 

(1)  Will  Che  esclnates  of  the  two  agents  eventually  converge? 

(2)  If  ^hey  converge  will  they  agree? 

The  model  proposed  In  this  report  Is  similar  to  that  of  {3).  The  only 
difference  between  the  two  models  lies  in  (Al).  In  (31  all  agents  are 
Bayesian  with  Che  same  view  of  the  world;  in  Che  model  proposed  in  this  report 
Che  agents  are  Bayesian  but  have  different  views  of  Che  world  (different 
P^*s).  Moreover  each  agent  supposes  that  the  other  agent's  a  priori  distribu¬ 
tion  is  Che  sane  as  his.  Thus,  Che  agents  are  unaware  that  they  have  dif¬ 
ferent  views  of  Che  world.*  Because  of  (Al)  the  same  message  has  different 
meanings  for  its  sender  and  receiver;  each  agent  interprets  his  data  in  terms 
of  his  own  model  and  generates  the  conditional  expectation  according  to  his 
prior  distribution  P^.  The  Interpretation  of  the  data  and  Che  message 
generation  by  each  agent  are  considered  in  the  next  section. 


*The  case  where  the  agents  are  aware  that  they  have  different  views  of  the 
world  reduces  to  the  model  of  (31;  the  agents  can  first  agree  on  an  a  priori 
distribution  P  by  negotiation  (according  to  the  method  of  (6))'  and  then 
proceed  to  solve  the  est imat ion  problem. 
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3.  DATA  INTERPRETATION  -  MESSAGE  GENERATION 

In  making  his  estimate  x^,  agent  i  undertakes  two  logically  distinct 

operations.  First,  he  interprets  his  current  data  (his  original  observation 

and  the  received  messages)  in  some  consistent  manner.  This  interpretation 

converts  raw  data  into  structured  information.  Secondly,  based  on  the 

*1 

Interpreted  data  an  agent  generates  an  estimate  of  r,  which  he  transmits  to 

“X  “2  “I  *2  *1  ■*2 

the  other  agent.  Thus,  a  sequence  of  messages  x^  x^  x^  x^  x^***  is 
generated  as  follows: 

At  t**!,  I’s  estimate  is 


xj  ■  E*{x|Y^}  p^  a.s. 


(3-1) 


(where  E^  denotes  the  expectation  with  respect  to  the  probability  measure  p^ 

1  *1 

Induced  by  the  distribution  P  on  Q).  The  message  x^  is  transmitted  to 
agent  2.  Agent  2  Interprets  the  message  x|  as  the  realization  of  the  random 


variable 


-  E^  (xlY^)  p2  a.s. 


(3-2) 


That  is,  according  to  agent  2  the  realization  y^  of  Y^is  such  thet 


-I  ~1  ,~1.  2 

Xj  ■  Xj  «  E  (x|y  )  p  a.s. 


(3-3) 


At  t=2,  2's  estimate  Is 


-2  _  _2,  ,„2  2 

X E  (x|Y  ,x  )  p  a.s. 


(3-4) 
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and  this  estimate  Is  sent  to  1  who  interprets  it  as  the  realization  of  the 
random  variable 

~2  _i  ,  ,„2  •!.  1 

Xj  -  E  (x|Y  ,  Xj)  p  a.s.  (3_5) 

^  *  1 

In  other  words,  according  to  1  the  data  ^  (the  realization  of  Y^)  and  X}  is 
such  that 

*2  -2  „1  -  1-2  ‘1  1 

Xf  -  Xf  -  E  (x|y  ,Xj)  p  a.s.  (3-6) 

sk  •• 

2 

In  general ,  when  agent  1  receives  a  message  xj^  he  Interprets  It  as 


-2 
’Si  “ 


‘2 


_1,  ,-2  *1  *1 

E  Ixly  .Xj.Xj,* 


1 

p  a.s. 


(3-7) 


and  then  he  generates 

“1  _i,  ,  1  -2  -2  -2.  1 

-  E  {x|y  .Xj.x^.'-'.Xj^)  p  a.s.  (3_g) 

which  he  transmits  to  agent  2. 

Agent  2  interprets  this  message  as 


-1  *1  -2,  ,~1  *2  *2  *2  *2,  2 

*k+l  “  Vl  ”  ^  **  *•**  <3-9) 


and  then  transmits 


_2,  ,2-1  -1  ,  z 

I  •  ' '*1'  ■’■.•••••Vi)  ' 


(3-10) 


to  agent  1. 


E<)uations  3-1  through  3-10  describe  the  rules  according  to  which  the 
*1  *2  *l  *2 

sequence  of  messages  *j’*j.x  x  is  generated.  These  rules  are  well- 


defined  because  of  assumption  A2 . 


4.  CONVERGENCE  AGREEMENT 


In  this  section  we  prove  that  under  assumptions  A1  through  A4  and  the 
rules  by  which  aessages  are  generated,  one  of  two  possible  events  occurs: 
Either  _ 

1.  The  agents'  estlaate t  converge  and  the  agents  agree  asynptotically, 
or 

2.  The  agents  realise  that  they  have  different  models  and  stop  com¬ 
municating  any  further. 

To  prove  the  result  we  re-examine  the  'data  interpretation'  which  is  one 
of  the  two  distinct  operations  that  each  agent  undertakes  in  making  his 
estimate. 

At  first  we  let  r  denote  the  realization  of  the  random  variable  r. 

“I  1 

When  agent  2  receives  message  he  figures  that  Y  has  realised  a  value 

am 

y^  such  that 


<  t.  (4-1) 

“2  2  -2 

Similarly,  when  agent  1  receives  message  he  figures  that  T  has  realized  y 
such  that 


"1  "2  r2/  I  -J  ...  ,  „ 

E  -  X,  ■  E  (x|2^.x,.— .X,)  VI  <  E 


(4-2) 


the  limits  of  the  estimates  of  the  two  agents.  Then,  we  have  the  following 
result. 

Lemma  4.2 

1  1*1  *'21  ,  .2 
P  ^  P 

Proof 

Equation  4-23  follows  from  Eqs.  A-19  through  4-22  and  4-14  and  4-15.  O 

So  far  we  have  shown  that  each  agent's  estimates  converge  and  moreover 
each  agent's  estimate  coincides  asymptotically  with  hia  interpretation  of  the 
other  agent's  estimate  within  the  terms  of  his  own  view  ot  the  world.  The 
next  theorem  shows  that  asymptotically  the  two  agents  agree. 


I  z  -li 
lx  -  X  )  ■ 

^  m  m* 


(4-23) 


Theorem  4.2 

Under  Assumptions  A1  through  A4  and  4-4,  the  estimates  of  the  two,  agents 
agree  asymptotically. 


Proof 

When  Eq,  4-4  is  true,  Eqs.  4-1  and  4-2  hold  for  all  k,  consequently 


*1 

X 


-1 

X 


and 


(4-24) 


Agreement  then  follows  from  Eq.  4-24  and  Lemma  4.2 
The  Investigation  of  Case  1  Is  now  complete. 

1 1 


O 


Case  2 


The  following  result  Is  true  in  this  case. 

Theoren  A. 3 

If  Eq.  4-5  is  true  the  agents  realize  that  they  have  different  models  and 
stop  communicating  any  further. 


Proof 


If  Eq.  4-5  occurs  for  agent  i  at  some  time  i,  then  agent  1  must  assume 
that  the  sequence  {y^  (Y)}^  Is  ‘’impossible**  (l.c..  sn  event  of  zero 
probability),  or  adre  reasonably  agent  i  must  assume  that  the  two  models  p^ 
and  p2  are  different.  Thus,  further  communication  is  not  necessary  (unless 
the  agent  is  willing  to  modify  his  model  and  reinterpret  the  sequence  of 
received  messages). 


Remark 

If  after  Y^  (Y)  »  ^  agent  i  is  willing  to  modify  his  model  (his  prior 
probability),  then  it  can  be  shown,  using  the  results  of  (6]  and  13),  that 
eventually  assumptotic  agreement  can' be  achieved.  The  situation  where  agents 
are  willing  to  modify  their  beliefs,;  after  they  receive  an  ’Impoesible'* 
sequence  of  messages  according  to  their  initial  view  of  the  world,  is  similar 
to  that  considered  by  Kreps  and  Wilson  (8)  who  study  dynamic  games  of  perfect 
recall  and  determine  ’*sequentlal  equilibria**  for  these  games.  In  Kreps  and 
Wilson  (8J  the  agents  modify  their  beliefs  (expressed  by  a  behavioral  strategy 
and  a  probability  measure  on  the  elements  of  every  information  set)  whenever 
an  information  set  of  measure  zero  is  reached  In  the  game. 

The  Investlgat ion  of  Case  2  is  now  complete. 


i  ■: 
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The  results  obtained  so  far  show  that  the  communicating  agents  either 
agree  or  realize  that  they  have  different  models  and  terminate  their  coiimuni~ 
cation.  The  example  that  follows  shows  that  agreement  may  depend  on  the  order' 
of  communication. 

Example  A. I 

Let  Q  -  (0,2)  X  10,31  (Fig.  4-1).  Assume  that  P*  Is  the  Lebesgue 
measure  (normalized  to  give  P(0)"l),  and  P^  is  such  that 

f2(Ci)  -  2/12,  p2(C2)  -  3/12,  -  7/12  . 

The  distribution  on  each  of  Cj,  C2.  C3  is  uniform.  Agents  1  and  2  try  Co 
estimate  1(A)  where  A  ls__shown  in  Figure  4-1.  At  t~0,  agent  1  observes 
{l(Bx),  1(82)}  and  agent  2  observes  {l(Ci),  l(C2),  1(C3)}.  Let  wcBj  O  C3. 
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Figure  4-1.  Parameters  of  Example  4.1 

Consider  first  the  situation  where  agent  1  sends  his  estimate  x|  to 

“2 

agent  2  at  times  fl ,3,S,** * ,  and  agent  2  sends  his  estimate  x^  to  agent  1  at 
times  t-2,4,6,***. 

Then  xJCw)  ■  E  {l(A)|l(Bj)}  ”  1/2.  Agent  2  receives  1/2  and  interprets 
it  as  follows.  He  belle’-es  agent  1  observed  ItB^)  because  only  then 

E^  (1(A)|1(B2)}  -  1/2  -  x[(w)  . 
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Agent  2  initially  observes  KC^).  Consequently  after  he  receives  x|(w)  his 
new  estimate  is 

-  E^{1<A)11(B2  n  Cj)}  -  3/A 

This  estimate  is  transmitted  to  agent  1  who  interprets  it  as  follows:  He 
concludes  that  agent  2  observed  1(C2)  because  only  then  E^l(A)|l(Bj  O  C2)}  ~ 
3/A.  Then,  agent  I's  estimate  is 

ijcw)  -  eMi(A)|1(Bj  n  C^)}  -  3/A 

and  this  estimate  is  transmitted  to  agent  2.  From  that  point  on  further 
communication  does  not  convey  any  more  information,  so  the  agents'  estimates 
agree  and 


Consider  now  the  situation  where  agent  2  sends  his  estimate  Co  agent  I 

*1 

at  times  t"l,3,S,***  and  agent  1  sends  his  estimate  x^  Co  agent  2  at  times 
t*‘2,A,6,«**  .  Again,  initially  agent  1  observes  l(B^)  and  agent  2  observes 
HC^).  Hence 

-  E^(i(A)I1(C^))  -  1/2 

*2 

Agent  2  receives  Xj(u)  and  interprets  it  as  follows:  He  believes  that  agent  2 
observed  either  UC^)  or  l(C^)  because  only  then 

e*(i{a)|i(Bj  n  c^))  -  e‘  (i(A)Ii(Bj  n  c^))  -  1/2 

lA 


Then  agent  I  foras  he  new  estlnate 

i}(«) -|e*1i(a)|i(Bj  n  c^))  +|e*{i(a)|i(Bj  nc^)l  -  J 

which  he  communicates  to  agent  2. 

Agent  2  can  not  interpret  consistently  this  message  lur  the  following  reason: 

He  knows  Chat  agent  1  observed  either  1(6^)  or  KB^)*  He  also  knows  Chat 
agent  I  knows  Chat  agent  2  observed  either  KC^)  or  l(C^).  Hence  if  agent  1 
observed  1(B^}  he  should  transmit  to  agent  2  the^message  x^(u)  ■  0.4;  on  the 
other  hand  if  agent  1  observed  Id^)  he  should  transmit  to  agent  2  the  message 
Xj(tt)  ■  0.6.  The  message  x^(u)  *  0.5  has  a  zero  probability  according  to 
agent  2's  view  of  the  world,  consequently,  agent  2  realizes  that  he  does  not 

have  Che  same  model  as  a^enc  1  and  any  further  communication  is  not  necessary. o 
Example  4.1  sho%fs  that  it  is  not  possible  to  partition  Che  space  Y  of 
observations  Jf  ^  (Y^,Jf^)  into  sets  Y®  Y^  such  that 

YC  U  Y®  -  Y 

f  Yc  n  Y^  -  6 

and  so  chat  if  c  Y^  agreement  is  always  achieved  whereas  if  JT  e  Y^  Che 
agents  realize  eventually  chat  they  have  different  pxidels. 

Finally,  we  restrict  attention  to  the  Gaussian  case.  By  this  we  mean 
Chat  (1)  Che  measurements  are  jointly  Gaussian,  (li)  Assumptions  A1  through  A4 
hold.  Then  we  have  the  following  result: 

Lemma  4.3 

For  the  Gaussian  case  agreement  Is  always  achieved  when  the  order  of  com¬ 
munication  is  fixed.  However,  when  the  two  agents  communicate  simultaneously, 
then  agreement  occurs  with  zero  probability. 


Proof 


When  the  order  of  connunlcacion  is  fixed  agreement  follows  from  Lemma  3 
of  (3)  and  the  fact  that  with  only  one  measurement  and  two  agents  having  the 
same  view  of  the  world  it  takes  only  one  communication  to  reconstruct  the 
centralized  estimate. 

When  the  agents  communicate  simultaneously  then,  in  view  of  the  results 
of  (3],  after  one  communication  each  agent  believes  Chat  he  has  reconstructed 
Che  centralized  estimate  (according  to  his  own  mlodel),  thus  Che  two  agents  do 
not  communicate  any  further  and  agreement  occurs  with  zero  probability. 


5.  CONCLUSIONS 


This  report  extended  the  results  of  (3)  to  the  case  where  agents  with 
different  beliefs  (different  prior  probabilities)  exchange  information. 

The  process  of  expectation  formation  for  such  systems  was  described,  and 
it  was  shown  Chat  agreement  depends  upon  the  order  of  communication  between 
Che  agents. 
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ABSTRACT 


A  team  must  agree  on  a  common  decision  to  minimite  the  crpectcd  cost. 
Differeot  team  members  have  dj.^ferenl  observations  relating  to  the  'state  of  the 
world',  and  they  mar  also  have  difrercDt  prior  beliefs.  To  reach  a  consensus 
(hey  exchange  tentative  decisions  based  on  their  curreni  information.  Two  ques¬ 
tions  are  discussed:  When  do  the  individual  estimates  converge!  If  they  con¬ 
verge,  will  a  consensus  be  reached? 


ARE 

MISSING 

IN 

ORIGINAL 

DOCUMENT 


I.  INTRODUCTION 


A  team  or  conmitice  of  S'  people,  indcxetl  i  I,  .  .  .  ,N,  must  agree  on  a  com¬ 
mon  decuioo  «  to  be  selected  from  a  pre-specined  set  t/  so  as  to  minimize  the  cost 

(I) 

where  J  b  a  real  valued  function  of  the  ‘state  of  the  world'  u  6  0,  and  the  decision  w. 
Initially,  different  people  have  different  information  relating  to  u>.  This  is  modeled  by 
stipulating  that  person  •  observes  the  value  of  the  random  variable  }'■  (w).  Every¬ 
one  knows  that  i  knows  although  i,  does  not  know  what  the  value  of  K,- 

actually  b.  Everyone  knows  the  function  J. 

Each  person  has  a  prior  belief  concerning  u.  We  stipulate  that  I's  prior  belief  is 
summarized  by  the  probability  distribution  F*  on  (H.F)  where  F  b  khe  o-ndd  of  events. 
If  *m  •  •  •  i=z  ,  wc  say  that  the  beliefs  are  comitttnt;  otberai.se  they  incon 
tUtent, 

Since  initially  different  people  have  different  information,  and  also  because  their 
beliefs  may  be  inconsistent,  their  estimates  of  the  best  decision  will  also  be  differcnl.  To 
arrive  at  a  consensus  decision  it  b  necessary  for  them  to  shore  information.  We  suppose 
that  this  information  b  shared  by  means  of  the  following  procedure. 

Consider  person  i.  In  the  first  round  he  makes  an  estimate  «,-(!)  which  is  based  on 
his  initial  data  K;,  and  he  communicates  this  estimate  to  some  or  all  of  the  other 
members.  By  the  tiuic  i  makes  his  setosd  estimate,  he  will  have  received  the  estimates 
of  some  of  the  others.  More  generally,  denote  by  f?,(f-I)  the  mcessees  received  by  i 
from  tL:  others  before  i  makes  bis  f'*  estimate  u,(0-  That  estimate  will  be  based  on 
y,-  and  We  assume  that  •  communicates  all  his  estimates  to  a  fixed  set  of  the 

other  people,  and  that  there  is  a  message  transmission  delay  of  one  time  unit. 
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Our  aim  b  to  discuss  two  questions:  Will  each  person’s  estimate  converge  as 
I  — •  oo?  If  the  individual  estimates  ccuvcrge,  will  they  reach  a  common  limit?  To  for¬ 
mulate  these  questions  mathematically,  we  need  to  specify  how  each  person  estimates 
the  best  decision  based  on  the  data  available  to  him.  This  is  done  in  Section  2.  Once 
this  is  done,  it  turns  out  that  the  answers  depend  crucially  upon  whether  the  prior 
beliefs  are  consistent  or  inconsbtent.  The  consistent  case  is  considered  in  Section  3,  and 
the  inconsbtent  case  in  Section  4.  Section  S  outlines  some  directions  for  further 
research. 

2.  ESTIMATION  SCHEMES 

Several  different  estimation  schemes  have  been  considered  in  the  literalu. 

Borkar  and  Varaiya  (2]  consider  the  situation  where  the  committM  wants  to  esti¬ 
mate  a  random  variable  X,  and  they  suppose  that  the  ('*  estimate  made  by  u,  (t),  is 
th.'  conditional  mean  of  X  given  the  available  data,  i.e., 

.  (2) 

Here  E'  deactrs  expectation  with  respect  to  P’ .  We  will  see  later  that  the  right  hand 

side  of  (2)  has  to  be  interpreted  carefully  when  the  beliefs  are  incoDsbtent.  For  the 

moment  observe  that  the  estimate  given  by  (2)  b  abo  the  decision  that  minimites  the 

(expected  >  of  vuc)  cost  fasetion 

when  the  information  available  b  { F,  ,  Z7,  ((-!)}.  Aumann  {l},  and  Geanakoplos  and 
Polemarchakis  |4)  consider  the  situation  In  which  the  group  wants  to  estimate  the  proba¬ 
bility  that  a  particular  event  F  ^  F  has  occurred.  This  b  a  special  case  of  (2)  with 
X  1(F).  The  set  U  of  all  possible  states  is  Hnite  is  (l)  and  {^J.  Tsitsiklis  and  Athans 
j/j  consider  the  situation  described  in  the  introduction.  Sebenius  and  Geanakoplos  [5] 


£ 


; 


*.(l)-£{A'|  J:.(i-1))  .  H) 

Let  £,{oo)  Since  »  an  SociCAsins  sequence,  it  follows  from  the  mar¬ 

tingale  convergence  theorem  that 

«.(0-.w.(oo)«.s.;«.(co):=£{A'|  r,(oo))  .  (5) 

Thus  the  individual  estimates  do  converge. 

Next  we  investigate  whether  the  limiting  estimates  agree.  Suppose  i  communicates 
his  estimates  to  /.  Then  «i(0  is  J^(l-H)-measurabIe.  From  (5)  it  follows  that  u,(oo)  is 
J^  foo)- measurable,  and  so, 

«i(eo)-£{«,{oo)|  KiMniSM)  •  (0) 

Suppose  there  is  a  ecmmmnicmtton  ring  I'l,  .  •  •  ^  'i.  This  b  a  not  necessarily 

distinct  sequence  of  persons  such  that  i|  communicates  his  estimates  to  i|^|.  Then, 
according  to  (0),  we  must  have 

•h(oo)  —  ®  I  n  —  1,  . . .  ,n  ,  (7) 

where  1V4.1  quite  easy  to  show  (2,  Lemma  2)  that  (7)  implies 

- • 

so  that  the  asymptotic  estimates  of  the  members  of  a  communication  ring  agree.  This 
suggests  the  main  result  of  [2]: 

Theorem  1. 

If  the  estimates  of  1  are  given  by  (2),  then  each  person’s  estimate  converges.  More¬ 
over,  if  everyone  in  the  team  b  a  member  of  the  same  communication  ring  then  the  lim¬ 
iting  estimates  agree. 


Proof: 
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Tenekctiis  (8|  show  siso  that  the  a^ccment  conditioo  for  rinp  is  satisfied  bj  decision 
rules  which  are  optimal  in  the  sense  defined  below: 

Proposition  i. 

Suppose  lhat  d  is  a  decisiot  rule  such  that  o(d(£*  ))  C  for  all  o-ficlds  C  £. 
Then  d  satisfies  the  agreement  condition  for  rings  if  and  onijr  if  there  is  a  partial  order- 
ing  <  of  the  set  of  funct’ons  {d(F*  ) :  F*  C.  F)  such  that  d(F*  )  is  the  maximum  cle¬ 
ment  of  {d{G)  ‘  G  C  F,  tr{d{G))  C  £*  )  with  respect  to  <. 

Proof: 

See  Appendix  C. 

In  man;  eases  as  in  (2],  (7]  the  partial  order  relation  is  defined  in  terms  of  a  scalar 
cost  function.  The  following  proposition  proves  that  decision  rules  defined  by  such  cost 
functions  satisfy  the  agreeuent  condition  for  rings,  provided  that  the  decision  includes  a 
tie-breaking  rule  of  the  cost  function  has  more  than  one  minima. 

Proposition  2. 

Suppose  that  the  decision  functions  take  values  in  a  set  U.  Let  L  be  a  real-valued  func¬ 
tional  of  P-measurable  decision  fnnctio>ns  d  :  fl  — »  I/.  For  each  F*  let  £)(F*  )  be  the  set 
of  F*  -measurable  decision  fonctions  i  such  that  L(6)  <  L{F  )  for  all  measurable 
f  :  n  — •  (/.  Assume  lhat  U  is  partially  ordered  by  <'  and  that  for, each  F*  ,  there  is 
a  i  6  D{F*  )  such  tha;  F  6  OfF*  )  implies  6'  (w)  <'  for  all  The  decision 

rule  d(f*  )  which  assigns  thb  S  €  D(£*  )  to  ^  satisfies  the  agreement  condition  for 
rings. 


Proof: 


3.3.  COMMON  KNOWTLEDGE 


The  main  feature  of  the  eatimalion  schemes  presented  in  (i),  (2|,  (4],  |8]  is  the  fol* 
louing: 

If  all  team  membt  ra  i.3C  the  same  detision  rule,  if  everyone  in  the  team  is  a  member 
of  the  same  communication  ring  and  if  common  knowledge  decisions  agree,  then  all  team 
members  agree  on  the  same  decision.  The  common  decision  is  the  decision  based  on  the 
ultimate  common  knowledge  (common  information)  of  the  team  members. 

Thus,  it  appears  appropriate  to  define  common  knowledge  at  this  point,  and  to 
show  that  the  definitions  of  common  information  given  in  (l),  [2],  (8],  (9]  are  essentially 
equivalent  and  lead  to  the  same  results. 

Aumann  (l]  represents  information  by  a  partition  P  on  the  sample  space  fl. 
Elorkar*V'araiya  (2]  and  Washburn*Tenckettis  (8]  represent  information  by  O'fields  con* 
tained  in  F.  It  can  be  shown  that  these  two  representations  are  essentially  equivalent. 
The  partition  P  is  a  collection  {E  I*  ^2»  * ' '  }  of  mutually  disjoint  events  whose  union  is 
the  whole  sample  space.  To  a  partition  P  there  corresponds  a  unique  (r-field  F,  namely 
the  O'field  generated  by  the  events  in  P.  Each  Ei  ^  P  is  an  atom  of  F.  if 
P|  {£|,  Ej,  '  '  '  }  *Dd  Pj  n  (fitj*  Ci,  ’  '  *  }  then  one  can  define  a  third  partition 
P}  which  is  the  finest  partition  contained  in  Pj  and  Pj  and  is  denoted  by  P|  AP..  if 
P|  and  P;  correspond  to  the  o-fields  Pj  and  P;  then  P|  APj  corresponds  to  Pj  AP;. 
Aumann  |l]  defines  an  event  E  to  be  common  knowledge  to  team  members  1  and  2 
(with  information  P,  and  P;  respectively)  at  u  if  there  is  an  atom  G  ^  Py  AP;  such 
that  or  G  C  C  P.  If  Pi  and  Pj  are  the  o-fields  corresponding  to  P,  and  P,  respective¬ 
ly,  then  the  definitions  of  common  knowledge  at  u  given  ia  [2],  (8],  namely  that  there  is 
C  G  P|  APj  and  w  G  G  CP,  are  equivalent  to  Aumann’s  definition.  Let  us  say  that 
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tbe  creot  E  is  common  knowledge  to  the  team  members  1  nod  2  if  it  is  common 
knowledge  nt  ench  u  ^  E.  Then  E  is  common  knowledge  to  I  nnd  2  if  nod  only  if  it  be¬ 
longs  to  the  o-field  genernted  bj  P,  APj,  nnmely  f,  aFj. 

Milgroro  |9]  characterizes  common  knowledge  by 

(i)  associating  with  each  event  E  another  ev'ent  with  the  interpretation 

Kg  {  uf  g  n  :  £  is  common  knowledge  at  ut) 

and 

(ii)  considering  the  following  four  conditions: 

(Cl)  Kg  C  E 

(C2)  V  ^^eh'g,  V  i.  if  w  6  F. .  F.  C  Kg 
(C3)  £,  C  Fj  A'e,  C  Kg^ 

(C4)  (  V  V  “  6  E,  if  w  €  Fi‘,  F^  €  FJ  -$»•  E  —  Kg. 

Condition  (Cl)  asserts  that  an  event  £  u  common  knowledge  only  if  it  actually  occurs. 
Condition  (C2)  implies  that  if  £  is  common  knowledge  then  every  team  member  knows 
that  £  is  common  knowledge.  Conditions  (Cl)  and  (C2)  imply  that  £  is  common 
knowledge  only  if  £  occurs,  each  team  member  knows  £,  each  knows  that  all  know  £ 
and  so  on.  Condition  (C3)  implies  that  wherever  £|  is  common  knowledge  any  logical 
consequence  of  £|  is  also  common  knowledge.  Condition  (C4)  asserts  that  public  events 
are  common  knowledge  whenever  they  occur.  A  public  event  is  defined  by  the  an¬ 
tecedent  in  (O'!);  it  is  an  event  which  if  it  occurs  will  be  known  to  every  team  member. 
Milgrom  ([9])  shows  that  his  characterization  of  common  knowledge  is  equivalent  to 


Aumann’s  dennition. 
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D»tioo  ID  D  that  can  be  stated  succioctly  ttsiog  commoD  knowledge:  a  cootingcot  al¬ 
location  /  is  crTicient  if  there  is  no  other  allocation  v  such  that  it  is  common  knowledge 
that  all  agents  prefer  f  to  v.  Milgrom  ({Oj-jlO])  used  the  idea  of  common  knowledge  to 
analyte  a  rational  expectations  trading  model.  He  showed  that  when  traders  exchange  a 
risky  security  on  the  basis  of  private  information  then  they  “agree  to  disagree."  (i.e.,  no 
trade  takes  place).  Keeps,  Milgrom,  Roberts  and  Wilson  (12)  consider  finite  repetitions 
of  the  well-known  prisorers*  dilemma  game.  A  common  observation  in  experiments  in¬ 
volving  finite  repetitions  of  the  prisoners*  dilemma  is  that  players  do  not  always  play  the 
single  period  dominant  strategies  but  instead  achieve  some  measure  of  cooperation. 
Keeps  and  hu  co-authors  in  (12)  show  that  the  lack  of  common  knowledge  about  one  or 
both  players'  options,  motivation  or  behavior  can  explain  the  observed  cooperation. 

4.  INCONSISTENT  BELEFS: 

The  analysis  b  quite  different  when  the  beliefs  are  inconsistent.  The  discussion  in 
this  section  is  initially  ^ased  on  Teneketris  and  Varaiya  [6].  Then  the  results  of  [0]  are 
extended  to  the  case  of  a  general  decision  rule  i.  To  keep  the  notation  simple  assume 
there  are  only  two  persons.  Alpha  and  Beta,  Initially,  Alpha  observes  the  random  vari¬ 
able  A  and  Bela  observes  B.  Both  wish  to  estimate  the  random  variable  X,  We  also  as¬ 
sume  that  n  b  finite.  The  prior  probabilities  of  Alpha  and  Beta  are  denoted  P“, 
respectively. 

For  t  =*  1,2,  •  •  •  the  if**  estimate  by  Alpha  {Beta)  is  denoted  o,  {0, ).  o,  is  the 
conditional  expectation  of  X  given  the  observations  A,0i,  .  .  .  Alter  a,  has  been 

calculated  it  is  communicated  to  Beta  whose  estimate  is  the  conditional  expectation 
of  X  given  B,  O],  .  .  .  ,Oi .  Once  0^  is  evaluated  it  is  communicated  to  Alpha  who  incor¬ 
porates  it  into  the  estimate  U|^{,  and  the  procedure  is  repeated. 


To  complete  the  •pecificatioo  we  oasume  lh»t  the  ealimklion  procedures  followed 
by  Alpha  ond  Beta  ur  coosistent  with  their  own  prior  models.  Thai,  is,  e^h  assumes 
the  other's  model  to  be  the  same  as  his  own.  Consider  Alpha.  When  he  receives  Deta'i 
estimate  Alpha  interprets  it  as  if  it  were  based  on  P“  rather  than  on  P^.  Thus  Al¬ 
pha  assumes  that  Beta’*  estimate  is  a  realitation  of  the  random  variable 

fit-t  E^{X  \  B,  a^,  .  .  .  ,0{,|)  . 

Subsequently,  d/pAa  calculates  Oi, 

Sf  £*  {X  I  d,  fii, . . .  • 

Symmetrically,  Beta  interprets  Oi  as 

and  calculates  fit  by 

fi,  {Jf  I  B.  . . . 

There  'is  a  more  revealing  description  of  the  functional  dependence  of  these  esti> 
mates.  Suppose  a  particular  realitation  <7  (d,  B)  has  occurred.  Since  Alpha  observes 

d ,  he  concludes  that  u  6  H  *  :■>  {(d,  B)  |  d~d }  and  so  his  first  estimate  equals 

5,  -  £“{Ar  I  d-d)  -  £:“{jr  I  w e  n,“) . 

Alpha  transmits  the  number  0|  to  Beta.  Beta  interprets  it  as  a  realitation  of  the  random 
variable 

o, -£^{X|d), 

and  so  he  infers  that  w  6  fl/  {w  |  o,(w)  —  o,,  B  =  B),  and  his  first  estimate  takes 
the  value 

fit^  E‘’[x\uren‘>] . 

This  value  is  communicated  to  Alpha. 


14 


At  the  bcgioniog  of  the  ('*  round.  Alpha  stnrts  with  the  inference  u  g  n*|  when 
he  receives  the  estimate  He  interprets  it  as  a  realitation  of  the  random  variable 

^*-1  *  £*{A  I  B,  0|, .  . . 

A  w 

and  so  Alpha  concludes  that  Zi^  O*  {u  |  b;€  H*),  Hence  Alpha’t 

('*  estimate  takes  the  value 

o,  -  £*{jir  I  w  €  nn 

which  is  ccnmunicatcd  to  Beta.  Whereupon  Bela  interprets  it  as  a  realitation  of 

0|  '  (Jf  I  A,  £|,  . .  .  ffit-i)  t 

concludes  that  u  €  H/  (w  |  m  6  Hif,,  «!((•;)  ■■  a, )  and  evaluates  his  ('*  estimate  as 

A  -  E^{xio,en,^} . 

Thus,  as  expected,  the  uncertainty  diminishes  with  each  exchange,  n|*|  C  H”, 
n,*i  C  n,^.  From  the  description  above  we  also  see  that  if  for  some  k  cither 
fii*,  —  n*  or  n/^i  *  n/,  then  fl*  ■=»  OiVi  and  (i/  ■»  fof  t>k+L  Hence  for 
I  >  r  (which  cannot  exceed  the  number  of  distinct  elements  in  0),  0  *  and  Cl/  become 
constant.  These  limit  xts  depend  upon  the  realitation  u.  Call  them  n>‘*(u')  and 
respcctivcir. 

Tkere  are  two  possibilities.  The  first  u  that  n*(ar) »  ^  and  n/(u;)  =■  d-  This 
happens  because  at  some  stage  the  message  received  by  Alpha  is  “impossible:”  there 

A  ww 

is  no  w  such  that  or, the  message  0|  received  by  Beta  is  “impossible:” 

there  is  no  D  such  that  0|(h/)>>»0, .  Alpha  and  Beta  must  realiie  that  their  prior 
models  are  inconsistent.  Let  Of  be  the  set  of  all  realitations  that  lead  to  this  outcome. 

The  second  possibility  b  that  n  ®(w)  ^  and  n/(ti;)  jA  d-  in  this  case  for  I  >  T 

A  A  •  « 

the  estimates  stop  changing:  (w)  ■*  ^‘(w),  o,(u;)  =»  o.(w),  0((u;)  >=  a«(u), 

■>»  Since  for  every  I,  it  follows  that 


■V  ■> 


On  ihe  other  hand,  since  0,  and  a,  arc  based  on  the  same  model,  namely  P**,  it  follows 
from  Theorem  1  that  ^•(w)  >=  a*(a;).  For  the  same  rcasob  o«(u;)  •==  Thus  if 

u  6  0//  :=  n  -  f)/  ,  there  is  agreement  Of(<j)  =  ^i(w)  for  I  >  T..  It  ii  worth  empha$i:- 
ing  that  thi*  agrtement  need  not  be  a  reflection  of  the  eonsitteneg  of  the  two  modelt  , 
P^.  Rather  agreement  oeeurt  because  within  each  person's  model  there  is  sufficient  "un> 
certaintg"  to  permit  the  reconciliation  of  the  other's  messages  with  his  awn  observation. 
One  might  sag  that  agreement  could  result  from  two  wrong  arguments.  Vft  summariie 
the  preceding  analysis  as  follows; 

Theorem  3 

The  set  of  events  Cl  decomposes  into  two  disjoint  subsets  fl;  and  fl//.  After  T  ex¬ 
changes,  if  u;6  H/  both  agents  realise  theiir  models  are  inconsistent,  whereas  if  w  G  Cl/j 
the  two  estimates  coincide. 

The  result  u  fragile.  In  particular,  whether  a  realisation  w  ends  in  agreement  or  in 
impasse  can  depend  upon  the  order  of  communication  between  Alpha  Beta  as 
demonstrated  by  the  following  example: 

Example 

Take  fl  =»  [0,2]  X  (0,3j,  suppose  A /pA a  observes 

A  -  {l(a,),  !(«,)} 
and  Beta  observes 

/?=  {1(6,),  1(5,),  1(6,)} 

and  suppose  X  is  the  indicator  function  of  the  shaded  region  as  shown  in'  figure  1.  As¬ 
sume  that  is  uniformly  distributed  under  P**,  whereas  under 
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P^{6,)-2/12,  P^(65)  =  3/12.  P^(6,)  =  7/12 

u 

and  within  each  6,,  w  is  UDifornily  distributed.  Suppose  wG  o,  D  63  and  that  Alpha 

m 

iji*  i 

/  . 

communicates  first.  Then 

JT.  =  £:(XK«,)  =  1/2 

m 

9* 

V  "vV 

/}e(d  interprets  this  as  a  rcalitation  of 

£(A'|l(«, )»(«:)) 

Vi: 

Sinee  =■  S/12,  E\x\ue*i)  =*  1/2,  upon  learning  that  a,  =  »/2,  Bela  con¬ 

cludes  that  w  G  U},  aud  since  he  has  observed  that  {3  G  S  j  his  estimate  is 

m 

’  M 

*“  V 

fit  -£^(^1^6  «,nSj)-3/4 

j/  V 

/ 

/ 

yt/pAa  interprets  ^1  as  a  rcalitation  of  £*(*|wGfli,  fl)-  Since 

n 

£*(s|a;€a,n  6,)-  i/2, 

£•  (j|i.<Ga  ,n  62)  *=«  3/4, 

r*  T 

d 

£'(x|'.^a,n  6,)-  1/2, 

Alpha  concludes  that  u  G  UinS},  hence 

a2  =  £*(JirKfl,n62)  =*3/4  . 

fe 

Evidently,  ^  aj  =  a,  •=••••=  3/4  and  there  is  agreement.  (Note 

m 

• 

that  Alpha  believes  that  «  G  o,nS*,  Beta  believes  that  wG«2nSj,  in  fact  uJGa  iflS,. 

K‘*‘ 

J-.v 

f.'i 

Now  suppose  again  that  wGaiftSs,  but  this  time  Bela  communicates  first.  Then 

•*.  1 

h  a-eS 

m 

fit  -  £^(Jrla;G6,)  -  1/2 

t  ■>' 

! '  .'■* 

Since 

*  ..s 

£:'(A'|a^S,)=.  1/4, 

£*(.V|wG62)=- £*(A'KSj)-  1/2, 

Si 

upon  learning  fit  =  1/2,  Alpha  concludes  that  wGSjUS  j,  thee  his  estimate  is 

Si 

at  -»  £‘(ArK««in(62USj))  •=-  1/2  . 

But  Beta  expects  o’!  to  take  on  the  value 

»  n'-  ' 

■ 

li 

!  ■  .  ■  •  ' 
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,  £'’(Alwea,n(6;Ui,))  ■=  0.4 
or 


£'(AV€«,n(6jU6j))  =  0.0  . 

Thus  Beta  concludes  that  the  models  are  inconsistent. 

The  results  of  Tenekettis  and  Varaiya  (6]  can  be  extended  to  the  case  where  the  de¬ 
cision  rule  is  a  general  function  d  as  in  section'3.2.  We  discuss  this  case  next. 

Assume  the  same  model  as  in  Tenekettis  and  Varaiya  [O]  and  suppose  the  estimatss 
•(  and  fit  ve  generated  by  the  decision  rule  d  given  the  observations 
d,  fit,  fit,  ••  •  ,fii.\  B,  02, .  .  .  respectively.  Suppose  the  decision  rule  d  sa¬ 
tisfies  the  cgreemtnt  condition: 


for  allff,,  CjC  £, 

«r(rf(C2))  C  C  ^2  •=#>  rf(C;). 

Under  the  assumptions  above  one  can  prove  the  following  result: 

Theorem  4. 


If  n  is  finite  and  the  decision  rule  i  satisfies  the  agreement  conrlition  (15)  then  ei¬ 
ther  the  estimates  c  and  fi  agree  after  a  finite  number  of  comm*'.nic.'.'ii'ns  or  Alpha  a^d 
Bela  realite  that  their  models  are  inconsistent. 

Proof: 

See  Appendix  F. 

As  pointed  out  in  the  discussion  previously  the  investigation  ot'  convergence  and 
agreement  of  the  estimates  can  proceed  in  two  step: 

1.  Determine  what  each  team  member's  Ciodcl  predicts  about  the  evolution  and 
the  outcome  of  the  estimation  process. 
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2.  Examine  bow  these  predictions  compare  with  what  actually  happens  durinf  the 
estimation  process'. 

For  finite  n,  the  result  of  theorem  4  is  true  for  rules  that  obey  the  agreement  con¬ 
dition  for  a  very  simple  reason.  If  a  team  member’s  view  of  the  world  is  consistent  with 
reality,  (ben  agreement  must  result  after  a  finite  nuisber  of  communications  because  this 
is  what  is  predicted  by  the  team  member's  model;  anything  else  would  be  inconsistent. 

6.  CONCLUDING  REMARKS 

Recall  the  discussion  in  Section  3.1  and  3.2.  There  a  consensus  b  reached  via  a  se¬ 
quence  of  exchanges  of  tentative  deebions.  The  information  available  to  a  person  in¬ 
creases  with  each  message  exchange  and  the  limiting  consensus  decision  b  based  on  the 
information  common  to  ail  in  the  sense  that  d|(oo)  •«••••  ■■  djy(oo)  b  measurable  with 
respect  to  2^|(oo)n  •  •  •  nj^>v(oo)-  A  consensus  can  also  be  reached  if  ail  people  share 

their  initial  private  data  1 . We  may  call  thb  consensus  the  /uff  •n/srnis/ien 

decision.  It  turns  out  that  the  consensus  reached  by  exchanging  tentative  deebions  need 
net  coincide  with  the  full  information  decision.  However,  within  a  rather  simple  model, 
Geanakoplos  and  Polemarchakb  |4)  have  shown  that  the  two  deebions  are  "almost  al¬ 
ways”  the  same.  It  would  be  worth  investigating  thb  in  a  more  general  setting. 

Secondly,  even  when  the  two  deebions  are  the  same,  it  does  not  follow  that  all  peo¬ 
ple  obtain  the  full  iaforrsation.  i.e.,  it  need  not  be  the  case  that 
J^i(oo)  (f[y  i,  .  ■  .  ,Ys)’  If  “  *  proper  subset  of  ir{  K,,  .  .  . ,  }.  then  one 

could  argue  that  reaching  consensus  via  exchange  of  tentative  decisions  requirrs  a 
transfer  of  leas  information  than  the  exchange  of  all  private  infonnation.  This  too  is 
worth  further  investigation. 


10 


llccall  DOW  the  discuasioD  deaiiog  with  the  c»se  of  incoDsistcnl  beliefs  The  most 
interesting  fioding  is  that  Alpha  and  liela  can  exchange  alatcments  about  A'  and  eventu¬ 
ally  agree  even  when  their  views  are  different.  Thus  paradoxically,  the  realitation  that 
these  views  are  different  is  only  reached  when  further  communication  becomes  impossi¬ 
ble.  This  raises  several  basic  and  knotty  issues  that  need  further  investigation. 

One  can  readily  imagine  situations  where  the  most  important  thing  is  to  determine 
whether  or  not  the  beliefs  are  inconsistent.  In  the  communication  setup  of  Section  4  the 
realitation  that  beliefs  are  inconsistent  is  fortuitous— It  happens  only  if  Alpha  and  Bela 
reach  an  impasse.  How  should  one  structure  the  set  of  knessage  exchanges  so  as  to  ex¬ 
pedite  the  reaching  of  an  impasse! 

SuppKsse  now  that  Alpha  and  Beta  do  reach  an  impasse  (u^O;).  Our  analysis  stops 
at  this  point,  but  there  are  two  directions  that  can  be  pursued.  First,  observe  that  with 
the  realitation  that  ibeir  beliefs  are  different  comes  the  nnderstaoding  that  they  hare 

^  a  ■ 

•‘misread'*  each  other's  messages  (i.e.,  they  now  know  that  ijb  and  o,  ^  o,),  and 
consec^uently  their  estimates  have  been  "biased."  To  eliminate  this  bias  each  needs  to 
learn  what  the  other’s  view  b.  A  straightforward  way  of  permitting  such  learning  is  to 
suppose  that  from  the  beginning  Alpha  admits  that  Bela’a  model  might  be  any  one 
of  a  known  set  of  models  and  there  b  a  prior  distribution  on  P^  reflecting  Alpha’a 
initial  judgment  about  Dcla'a  model;  a  symmetrical  structure  is  formulated  for  Beta. 
A\^ithin  such  a  framework  it  seems  reasojable  to  conjecture  that  each  agent  will  correctlv 
read  the  other's  message  and  hb  serquence  of  estimates  will  converge.  But  if  their  models 
are  different  then  the  litniting  estimates  tnav  differ,  and  a  consensus  will  not  emerge. 

Suppose,  however,  that  .ilpha  and  Beta  want  to  reach  a  con.sensus.  To  reach  a  con¬ 
sensus  one  or  both  must  cba.oge  their  roolets.  One  can  imagine  msnv  different  wavs  in 
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which  this  can  be  done.  For  example,  De  Groot  [3]  proposes  that  each  person  tells  the 
others  what  his  prior  probability  is,  and  he  proposes  an  ad  hoc  behavioral  rule  whereby 
each  person  adjusts  his  model  to  a  weighted  average  of  the  others’  models.  This  u  not 
very  satisfactory  in  situations  where  commun'tattng  one's  prior  beliefs  is  not  practicable. 
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APPENDIX  A 


I 


Proof  of  Theorem  1 


Coovergence  of  each  inrmber's  nlimatea  follows  from  the  Martingale  Convergrnrr 
Theorem.  The  proof  of  the  rest  of  the  Theorem  proceeds  in  several  sle|>s.  Consider  two 
agents  i  and  /  and  let  C,(f)  denote  the  o-field  generated  by  the  transmission  and  recep¬ 
tion  of  messages  from  agent  •  up  to  time  (.  That  is, 


C.(0  “  ■.(•).  •  •  •  . 

•  •  •  >•,(<-1)) 

C;,(<)  —  <r{u,(l) . WiC*-!).  .  -  •  ,«Vi(l),  .  .  .  Vi(l),  .  -  .  , 

Vi(»-I) . «;i) . -^2) . 

Define  S'*  to  be  the  event  that  agent  •  sends  messages  to  /  inrmiicly  often.  Then 
Lemma  Al 

Both  «,(oo)l(5*)  and  «^(oo)t(5*)  are  common  knowledge  for  C,(oo)  and 
C^(oo).  Moreover, 


«.(oc)  1(5  •  )  -  E{X\G,{oo)  n  1(5  ■  )  «.». 


«,(oo)  1(5)  —  w,{oc)  1(5) 


5-5’  n  5'* 


Proof 


Since  ttere  is  »  mrsssee  transmission  delay  of  one  unit,  it  follows  that  5’  is  in 
G",|oo)  and  6y(oc),  Since  *,(l)  is  (t+ 1)- measurable  it  follows  u,(oo)  is  f7^(oc)- 
raeaaurable.  Similarly  (oo)  is  (7,  (oo)- measurable. 
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Consrqucoily, 

«.  (oo^  US'* )-  ^A'lC.  |oo)  n  c,  (oo))  . 

«,{oo)  l(S*' )  -  £:t,V|C7.(oo)  n  c,{oc))  US*' )  . 

Hcnfc, 

ii,(oo)  U-^)  “;(oo)  l(‘^) 

■ 

To  pro<ccd  further  »e  need  the  followtoc  result: 

Lemma  A2. 

Let  ii  r j,  .  .  .  ,r,*i  •“  ii  be  random  rectors  and  fj,  .  .  ,  be  cr-fields  such 

(hat 

r.  -  I  £. )  .-1,2 . (A.-l) 

Then 

r ,  1,  X,  ■■  •  •  •  •■  r,  (A  S) 

Proof 

We  can  assume  that  i;  are  Kaiars,  since  by  applying  the  some  argument  to  each 
component  we  can  generalite  the  result  to  random  rectors.  Suppose  first  that  each  x,  is 
square  intcgrable.  Since  conditional  expectation  b  the  best  mean  square  estimate  and 

f'(-,.i|P,  )•  follows  that 

£|x.p+  £|.V,  -  x.l-  .  -1.2 . n 

Addinc  llif  al>ove  relations  and  using  x.,,  ■=»  :  ,  we  get 


pmn  I 


Con vrjtjrntly  t,  “  x r ,—••••  •=•  x  Thus,  lemma  A.  hold  for  square  iotegrahlr 
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random  variable.  To  complete  the  proof  of  l>:inma  A2,  for  any  number  K  let 
=  min  .  K).  Then,  by  Jensen's  inequality,  *=  )  implies 

V.  (A.6) 

The  la.si  inequality  implies 

£  **r' >  £ >  •  •  •  >  ^ .-jvi  “  .-r . 

Consequently,  (A.fl)  holds  with  equality. 

Therefore,  for  ib,  >  ib. 


/‘•-  r.**-  E(r*',  - 


Since  r,  '  -  r,  *  is  bounded,  it  is  square  integrable,  therefore 


-  i, 


*9  ^  y 


r*-r‘* 

'  s,  « 


Lemma  2  follows  by  letting  A|— oo  and  ikj— oo. 


Lemmas  Al  and  A2  can  now  be  used  to  prove  the  following  result 
Lemma  A3. 

Suppos'-  th-vt 

(')  '  i>  ‘j,  ■  ■  •  >'««!  '•  form  a  communication  nog  for  5,  and 

(ii)  1(5)  is  common  knowledge  for  G,(oo).  C:(oo),  .  .  .  ,C,(oo). 

Then  u,  (oo)  agree  on  5,  i.e., 

u,('V3)1(5)  —  «:(oc)  1(5)  -.•••  —  u,(ro)l(5)  a.*.  (A.7) 

Proof  '  . 


Bv  Lemma  A  I 
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Si 


ts 

I 


r' 

L- 


n 


«.(c»)  1(5<-")  -  a:(A'|c.(«»)  n  c.„(oo))  - 
»  (00)1^7. (cx.)  n  ^7.o(oo))  I(5<-'>)  =  (A  S) 

=  a:{«.m(oo)  l(5<'‘")lC.(oo)n  C.^.Ioo)) 

By  hypothesis  (ii)  SCC,{oo)  and  SCS^'*'^  Multiplication  of  both  sides  of  (A.S)  by 
1(5)  gives 

«.(oo)  1(5)  «  £{u^,(oo)  1(5)  |C.(cx3)  n  <7,+  i(oo)}  (A.9) 

Eq.(A.9)  and  Lemma  A2  imply 

«i,(oo)  1(5)  -=  •ij(cx3)  1(5)  •  •  •  —  «/oo)  1(5) 

Lemma  A3  can  now  be  used  to  prove  the  following  result. 

Lemma  A4 


Under  the  hypothesis  of  Lemma  A3 

«,(oo)  1(5)  =  £{,V|(7,(oo)  n  C^oo)  •  •  •  n  C,(oo))  1(5)  (A.IO) 

Proof; 

By  (A.9) 


By  Lemma  A3 


«.(oo)  1(5)  -=  £{A'l(5)|C.(oo)  n  C.*,(oo)} 


(A.ll) 


«i,(oo)  1(5)  =  u,(oo)  1(5) , 

thus,  u,(oo)l(5)  is  common  knowledge  for  Cr|(co),  (/^(cio),  .  .  .  ,C^oo).  Taking  condi¬ 
tional  expectation  with  respect  to  (7|(oo)  n  C^oo)  •  •  •  fl  C,(oo)  we  obtain 


u.(oo)  1(5)  =  £(A-|i5)|f7,(oo)n  C:(oc)n  •  •  •  nC^oo)) 

=  £{A'|<7,(oc)  n  f7,(oo)  ■  ■  ■  nC,(oo))  1(5) 
since  by  hypothesis  1(5)  is  common  knowledge  for  (7,(oo),  (7j(a:) . C,(oo).  ■ 

The  assertion  of  Theorem  I  now-  follows  from  Lemma  A't  since  1(0)  is  common 

knowledge  for  all  persons.  The  estimate  of  e.ach  agent  converges  to 

'£{A'j(7,(oc.)  n  6\4oo)  •  •  ■  n  (7,(oo)). 
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APPENDIX  C 


Proof  of  Proposition  1 

At  first  we  show  that  if 

dir  )  ^  m^x  {  J(G):  G  C  F  ,  o{J{G))  C  r  )  (C.l) 

then  d  satisfies  the  agreement  condition  for  rings.  Suppose  that  a{d{F*  ))  <Z  G  C.  F'  . 

Because  ^  C  £*  ,  d{G)  <  d{F*  ).  Since  »(</(£*  ))  C  C,  it  is  clear  that 

d{r  )  €  {d(H)  :HCF,  oid{H))  C  C).  Thus,  d(r  )  <  d[C).  The  relation  <  is  a 

partial  order,  consequently  d(C)  <  d[P  )  and  )  <  d{G)  imply  d(F'  )  =  d(C). 

Hence  d  satisfies  the  agreement  condition  for  pairs  and  in  particular 

d(a(d[P  )))  *=  d(£*  ).  Suppose  •=£,+!  »nd  o(d(f)v'))  C  1  S  ^  ^ 

Then 

d{^)-^d(c{d(FJc)))<d[^^,) 

for  each  K,  hence  d{Fx)  <  d{Fj()  <  d(P,4.i  “  »od  so  ^(£1)  *=  d{Ff{ )  for  all  K. 

This  shows  that  the  decision  rule  d  defined  by  Eq.(C.l)  satisfies  the  agreement  condition 
for  rings. 

Conversely,  suppose  that  d  satisfies  the  agreemtwi  condition  for  rings.  Define  the 
partial  order  <  on  {d{F^  )  '  £*  C  £)  »*  follows:  Write  d(£|)  <  ^(£;)  and  only  if 
there  is  an  integer  ij  >  1  and  <r-ficlds  G^  C  £,  1  <  K"  <  17,  such  that  tT(d<F,))  C  G^, 
<r(</(C^-))  C  Gk*!  and  ^(G,)  «=  It  is  easy  to  see  that  d{^  )  <  d(F'  )  for  all 

£*  C  £,  (hence  <  is  reflexive),  and  that  d(£,)  <  d(£;)  and  </(£;)  <  d(Fj)  imply 
</(£i)  <  d[F^)  (hence  <  is  transitive).  Suppose  <f(F|)  <  <f(£^)  and  dlF^}  <  d(F,}. 
Then  there  are  <r-fields  C  £,  1  <  A'  <  17  +  m,  such  that  ))  C  C,. 

ff(d(^))  C  G^-^,,  I  <  F  <  q  +  m  -  I.  diF,)  -  rf(G,,,  ),  and  rf(F.,)  =  d(GJ.  The 
agreement  condition  implies  that  d(^- )  •=  d(G,)  for  all  A",  therefore  d(F,}  ■=•  d{F-<). 


CoDsrqucntly  <  is  aolisynimctnc  »nd  so  <  is  »  partial  order.  Finally,  ^  C  f  and 
<T(rf{CV))  C  r*  .  then  d{C)  <  d{r  )  by  definition  of  <.  Hence,  d{f^  )  is  the  tnaximurn 
clement  of  {<f(C)  =  £  C  £,  C  £*  )  with  respect  to  <. 


20 


APPENDIX  D 

Proof  of  Propoillion  2 

Suppose  n  —  y  are  P-measurable.  Define  S,  <'  '  S,  to  mean  either  that 

L{«j)  <  L(<,)  or  that  L(S..)  «  L(6,)  and  6,(u»)  <’  for  all  w.  It  is  easy  to  sec  that 

<'  '  so  defined  partially  orders  all  P  -measurable  decision  functions.  Suppose  that 
P  C.  F  and  i  a  za  P  -measurable  decision  function.  Since  d(P  )  6  D{P  ),  by  as¬ 
sumption  L{i)  <  L(d{P  )).  If  L(<)  —  )).  then  i  e  D{P  )  also,  and 

«(u;)  <'  dip  Kw)  for  all  w.  It  follows  that  )  maximitcs  {S:  a{6)  6  F)  with  respect 
to  .  In  particular,  d(P  )  maxiinites  ^  ^  E)-  Thus,  Pro¬ 

position  1  implies  that  d  satisfies  the  agreement  condition  for  rings. 


li 
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Proof  of  Proposition  3 

Assume  that  the  condition 

«T(<f(P,))  C  F;  C  f,  d(F,)=d(F,)  (E.l) 

is  true.  Then  ^<f(Fi))  V  <7(</{Fj))  C  Ft  H  F;  implies  oi</{Fi))  C  F|  n  F;  C  Fi 

which  in  turn  implies  if(F|)  =*  d(Ft  n  Fj)*  Likewise  ^(£2)  =  d(Ft  O  Fj). 

Hence,  the  condition 

<^d{Ft))  V  C  F.  n  F,  -=^  d(Ft)  “  d(Fi) 

is  true. 

Conversely,  assume  that 

cr{(/(Fi))  V  a{d[F ,))  C  Fj  D  F^  '■  ^  d{Fi)  =  d(Fj)  {E.2) 

is  true.  Then  Fj  C  F,  implies  Fj  «=  F,  fl  Fj.  Hence,  <K<i(F,))  C  Fj  implies 

i))  V  <><</(£?))  CF2=>F,  nFj.  Because  of  (E.2)  it  follows  that 

d(Fi)  =  d(Fj).  Thus,  the  condition 

<7(</(F,))  C  Fj  C  F,  d(F,)^d(Fj) 
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Proof  of  Theorem  4 

The  proof  of  Theorem  4  proceeds  in  various  steps:  First  we  describe  precisely  ihc 
evolution  of  the  estimation  pr'^cess  according  to  each  team  member’s  model,  and  deter-' 
mine  what  each  member's  model  predicts-.  Then,  compare  these  predictions  with  what 
happens  in  reality.  Both  Alpha  and  Beta  can  describe  the  evolution  of  the  estimation 
process  according  to  their  own  view  of  the  world  as  follows:  Let  a/  and  be  the  esti¬ 
mates  of  Alpha  and  Bela  at  time  t  according  to  Ts  perception,  (i  *=  Alpha,  Beta). 
Then, 

e'.flj,  ...,e/)  (F.2) 

where  i'  denotes  that  the  estimates  are  formed  according  to  the  rule  i  and  the  proba¬ 
bility  mea-s'ive  p'  induced  by  the  distribution  P‘  on  fJ.  Equations  (F.l)  and  (F.2)  con¬ 
sidered  for  all  t  and  for  all  w  €  fi  describe  the  evolution  of  the  estimation  process  ac¬ 
cording  to  member  I's  view  of  the  world.  To  determine  what  Alpha  .•>ad  Beta  predict 
about  the  outcome  of  the  estimation  process  in  terms  of  thc’ir  own  mocJei-«  consider  the 
following  o-fieids 

(i  =  Alpha,  Beta)  (F.3) 

—  <t{B,  a  \  ,  a  2,  ,  0(*) 

The  o-fields  F,'^  describe  the  view  of  member  i’  about  the  information  available  to 


Alpha  and  Beta  after  the  initial  observations  have  been  taken  and  t  tentative  decisions 
have  been  exchanged.  The  ir-rields  F,'^  and  evolve  dynamically  as  follows: 
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))  c  A  *"  n C  r  ’ 
ao<l  ))  c  nr'"  C  F'^ 

Silirc  d  M.itisfirs  llic  aprcmml  «on(!itions,  (F.I2)  itnplir^  that 

J’lF'*  n  F'”)^  d‘{r”)  .  IF  13) 

Thus,  both  Alyht  aod  Beta  predict  that  the  estimates  «ill  converge  and  agree  after  a  fin-, 

itc  number  of  steps. 

lo  reality,  tbe  following  is  happening:  At  time  t  >=  I  Alpka't  estimate  is 

J,  »  d\A)  (F.U) 

(where  A  is  Alpha'*  observaticn).  The  message  «|  is  transmitted  to  Beta.  Beta  inter-. 

prets  this  message  according  lo  his  own  view  of  the  world,  i.e.,  be  considers  that  tbe 


realisation  A  of  A  is  such  that 

J,  =  af  =  d*  (A  )  . 

Furllicrmorc,  for  a  coniistcat  iaterpretatioa  of  the  data  it  is  required  that 


P®(«,  «  «f)  >  0  . 


At  <  «=  2  Beta’*  e*timate  i* 


{F.15) 


(F.IO) 


^1  =  </-(».?,)  (F.17) 

(where  B  is  Beta's  observation),  and  this  estimate  is  transmitted  to  Alpha  who  interprets 

it  in  terms  of  his  own  model,  i.e.,  he  considers  that  the  realiiation  B  of  B  is  such  that 


h  =  «  </'  {B.  I,) 

For  a  consistent  interpretation  of  the  data  it  is  required  that 


{F.I8) 


P*  {Px*  =  !,)>  0  .  (F.19) 

In  general,  when  Alpha  receives  message  he  interprets  it  in  terms  of  his  own  view  of 

the  world,  i.e., 

=  rf'{A,  ■/?,.  ^,,  .  .  .  Jk  )  (F.20) 

which  be  sends  to  Beta.  For  a  consistent  interpretation  of  all  the  messages  received  by 


Alpha  and  lleta,  it  i)  required  that  si  soy  time  ( 

/•^(V  =  *^<  .  »  <  /  <  0  >  0 

(K  ”) 

sod  r"!*,*  -  s,  .  i  <  /  <  I)  >  0 

The  follouing  result  about  the  evolutioo  of  the  probabilities  of  (F.22)  is  true. 

Proposition  F.l. 

After  a  finite  number  of  steps  «  *  ,  either 

1  <  /  <  ^'^  )  =.0  (F.23) 

or 

Moreover,  for  all  s  >  «  ^ 

I  <  /  <  *)  “  ,  !</<*'')  .  (F-25) 

Similar  results  hold  for  P^{a(^  ~  •  I  ^  ^  ^ 

Proof 

The  result  follows  directly  from  the  fact  that  convergence  and  agreement  are  predicted 
to  occur  in  a  finite  number  of  steps  by  both  models.  The  time  is  given  by  (F.l). 

B 

Based  on  the  proposition  above  we  can  complete  the  proof  of  Theorem  4  as  follows; 

If 

1  </<0>o 

pS(a/  =  ?,  1  <  /  <  0  >  0 

are  true  for  all  t  <  a*  ,  respectively,  and  (F.24)  is  true  for  both  P^{-)  and  P®(-). 
then  because  of  (F.13j  and  the  rules  by  which  the  messages  arc  interpreted 

=.  =  d^(FP^  -=  d^(F,^^) 

for  ,'f  I  >  max  ,  s  ®  )  and  the  estimates  of  Atphe  and  Beta  agree  asympto!  ^  'f 


oo  ibf  othrr  hanti  (F.2.'l,)  ifur  at  vxnc  iiror  I  for  either  Alpha  or  lirla,  then,  at 
time  Alpha  or  litta  realire  that  the  »ri]t>rtire  ot  rereived  i”!  or 

rcaaooahly.  Alpha  or  //eta  rouit  eonriudr  that  the  t»o  moileli  ^  and  are  ii 


a/. 

til  It 

iii'-ir'' 

lif  oil- 
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ABSTRACT 


In  this  paper,  we  develop  additional  results  on  the  problem  of  reaching  a  consensus  of  opinion 
between  two  decisionmakers  provided  with  different  information.  Specifically,  we  study  the  problem 
where  the  tsvo  decisionmakers  may  have  different  underlying  probability  models.  We  develop  results 
characterizing  the  likelihood  of  a  consensus  being  reached  in  terms  of  the  nature  of  the 
inter-decisionmaker  communications.  We  also  study  the  problem  when  the  decisionmakers  are  aware 
of  the  possibility  that  they  may  have  different  models.  In  this  case,  the  decisionmakers  can  reach  a 
deadlock  state  where  neither  decisionmaker  can  learn  additional  information  from  the  consensus 
process,  and  they  cannot  reach  a  consensus  decision.  This  surprising  result  indicates  that 
incorporating  human  uncertainty  in  probability  assessment  into  the  consensus  problem  can  lead  to 
outcomes  not  anticipated  in  the  general  theory  developed  in  refs.  [1]  -  [7]. 
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1.  INTRODUCTION 


The  general  problem  of  reaching  a  concensus  of  opinion  among  several  decisionmakers 
provided  with  different  information  has  received  considerable  attention  in  the  recent  literature  ( 1 117). 
The  consensus  problem  consists  of  finding  a  decision,  which,  to  each  decisionmaker,  is  the  correct 
decision  according  to  a  specific  decision  rale,  given  his  information.  Decisionmakers  approach  a 
consensus  by  exchanging  tentative  decisions  among  themselves,  thus  exchanging  part  of  their 
information. 


In  [l]-[7],  a  Bayesian  framework  was  developed  for  analyzing  the  consensus  problem.  Under 
the  conditions  that  all  decisionmakers  share  a  common  prior  probability  model,  Aumann[  1],  Borkar 
and  Varaiya  [2],  Tsitsiklis  and  Athans  [3],  Geanakoplos  and  Polemarchakis  [4]  and  Washburn  and 
Trneketzis  [5]  showed  that  decisionmakers  would  approach  a  consensus  under  mild  regularity 
conditions  on  the  communication  pattern. 

In  subsequent  papers  [6][7],  Teneketzis  and  Varaiya  showed  that  relaxing  the  condition  that  all 
decisionmakers  share  a  common  probability  model  could  lead  to  eventual  disagreement  Specifically, 
they  showed  that  when  llie  fact  tliat  each  decisionmaker's  probability  model  can  be  different  is  secret 
knowledge  [12]  (not  available  to  any  decisionmaker),  the  consensus  process  can  reach  a  state  of 
contradiction,  thereby  revealing  that  the  underlying  probability  models  were  different 

In  this  paper,  we  examine  in  greater  depth  some  of  the  issues  raised  by  the  results  of 
Teneketzis  and  Varaiya  [6],(7].  We  limit  our  study  to  the  case  of  two  decisionmakers  involved  in  the 
consensus  problem.  First  we  study  the  question  of  how  likely  arc  the  agreement  or  disagreement 
outcomes  when  the  two  decisionmakers  have  secret  probability  models.  Then,  we  study  the 
consensus  problem  where  each  decisionmaker  can  have  multiple  probability  models,  so  that 
knowledge  that  the  underlying  probability  models  can  be  different  is  common  knowledge.  Based  on 
this  assumption,  we  develop  a  new  Bayesian  formulation  of  the  consensus  problem  which  is  similar 
to  the  Bayesian  formulation  for  games  of  incomplete  information  [8].  Using  this  formulation,  we 
show  that  the  general  frameworks  of  Washburn  and  Teneketzis  |5]  and  Teneketzis  and  Varaiya  [7]  can 
be  extended  to  study  issues  of  convergence  and  agreement  in  this  problem. 
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framework  which  is  used  to  study  the  consensus  problem.  In  section  3,  we  develop  additional  results 
on  the  problem  studied  by  Tencketzis  and  Varaiya  (6),  I?).  In  section  4,  we  discuss  the  consensus 
problem  with  multiple  models.  Section  5  contains  a  discussion  of  the  results. 

2.  PROBLEM  FORMULATION 

Throughout  this  paper,  we  will  use  the  following  stochastic  decisionmaking  model: 

Let  (Ci,F)  denote  a  measurable  space,  with  F  denoting  the  o-ficid  of  measurab'e  events*.  Let 
{P‘»  i  cl}  denote  a  family  of  probability  measures  on  this  measure  space.  The  set  I  is  assumed  to  be  a 
discrete  set,  finite  or  countable,  with  the  discrete  topology.  The  measure  space  (O,  F )  represents  the 
uncertainty  present  in  the  decision  problem. 

There  are  two  decisionmakers  (DM)  in  the  consensus  problem.  DM  1  (2)  has  a  personal 
probability  model  (£2,  F ,  P‘l)  ( (£2,  F ,  P’2)),  where  ij  and  12  are  selected  from  the  index  set  I.  In 
addition,  each  DM  has  a  probability  distribution  over  I,  representing  his  beliefs  that  the  other  DM  is 
using  a  particular  model,  as  follows: 

Denote  by  I  x  I  the  event  space  of  all  possible  combinations  of  models  for  the  two  DMs,  with 
the  product  o-field  2^  x  2^.  Let  Pj,  P2  denote  probability  measures  on  this  space.  DM  I's  (2's)  initial 
private  information  concerning  the  pair  of  probability  measures  {  P'l,  P’2}  used  by  the  DM’s  is 
represented  by  the  a-field  Hj  (H2),  where  Hj  is  generated  by  atoms  h*j  of  the  form 

h*j=  {ij}xKj  ,  where  1 3  Kj  (1) 

Similarly,  H2  is  generated  by  atoms  h^j  of  the  form 

h^j  =  Kjx{i2},  where  1 3  Kj  (2) 

Eqs.  1  and  2  mean  that  each  DM  is  provided  with  private  information  concerning  his  own  probability 
mrdel,  plus  the  information  that  the  other  agent’s  model  belongs  to  a  specific  set  of  models. 


.•.’4 
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In  addition  to  private  information  concerning  the  pcrvmal  probability  models  used  by  each 
DM.  each  DM  receives  private  information  concerning  the  true  event  which  occurs  >n  the  measurable 


•  The  reader  unfamiliar  with  the  concepts  of  measure  theory  should  refer  to  f  10). 
space  (ii,  F  ).  This  information  is  represented  by  finite-valued  F  -measurable  functions 

y‘:  Q - »  Y>  (3) 

Let  y  *  in  F  denote  the  o-field  induced  by  y*.  Then,  y  ‘  is  a  finite  o-ncld 

We  can  combine  the  measurable  spaces  (Q,  F )  and  (1  x  1. 2^  x  2^)  to  form  the  product  space 
(Q  X  I  X  !,  r  X  2^  X  2^.  On  this  space,  define  the  measures  Oj,  fli  follows:  Let  ^  z  F  .  Then, 

ni(F,ij,i2)  =  Pi(ij,i2)P'l(F)  (4) 

n2(F;i,.i2)  =  P2(»i>>2)P'2CF)  (5) 

n,.  02  are  probability  measutes  on  (D  x  I  x  I,  F  x  2^  x  2^)  because  P*  are  probability  measures  for 
each  i. 

For  two  o-fields  A  ,3  ,  define  A  V  fl  to  be  the  smallest  a-field  containing  both  A  and  B  . 
Similarly,  define  A  AB  to  be  the  largest  o-field  coritained  in  both  A  and  B  .  Define  Qj,  Q2  to  be 
the  restrictions  of  nj,n2  to  the  o-field 

F  =  {yj  vy2}  X  {H,  VH2}.  (6) 

The  decision  rule  used  by  each  decisiomaker  is  a  rnap  from  o-fields  A  into  decision  functions, 
which  depends  on  the  probabilities  Qj  as  in  [5).  We  assume  that  the  decision  niies  are  F  - 
measurable.  For  the  problems  considered  in  this  paper,  the  decision  rules  wiil  be  of  the  form 

djfF)  f  arg  min  C^i  {  u)  i  F  }  '  '  (7) 
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ucU 


(i2(ri  e  arg  min  E^2  {  J((0,  u)  |  F  }  (8) 

u  c  U 

for  F  £  l_  where  U  is  ihc  space  of  allowable  decisions,  and  the  subscript  i  is  used  to  denote  the 
expjcctat  1  according  to  the  probability  distribution  Qj.  Note  that  multiple  solutions  to  eqs.  7  and  8 
can  occur.  Usually,  there  will  be  a  tic-breaking  procedure  for  selecting  dj(F),  d2(F).  We  assume  that 
U  is  compact,  and  that 

X  U  - - » (0,«)  (9) 

is  a  continuous  function  of  of  u  for  each  w  £  Cl.  As  in  [51,  let  a(dj(/4  ) )  be  the  CT-fidcl  generated  by  the 
decision  rule  d  under  probability  YFli  when  the  available  information  a-field  is  A  . 

The  consensus  process  can  now  be  described.  Each  DM  receives  initially  one  measurement  yj. 
Based  on  this  measurement  and  his  probability  model,  each  DM  computes  a  tentative  decision 
according  to  a  decision  rale  dand  communicates  it  fo  the  other  DM.  Then,  each  DM  sequentially 
interprets  the  other  DM’s  decision,  revises  his  own  decision  due  to  the  acquired  knowledge,  and 
communicates  his  new  decision  to  the  other  DM.  This  process  creates  a  sequence  of  information 
o-fields  F  j(n),  F  2(0)  evolving  in  the  lattice  of  sub-o-fields  of  E.-  With  this  process,  a  sequence  of 
decisions  is  generated. 

Let  U|(n)  (ujfn))  denote  the  value  of  the  nth  communication  of  DM  1  (DM  2),  selected  as  a 
function  of  the  information  available  to  him  according  to  his  decision  rule.  For  (to,ij,i2)  in  Q  x  I  x  I, 
we  say  that  the  DMs  reach  a  consensus  (agreement)  at  f&).i^.i2')  if  and  only  if 

lim  Uj(n)  =  lim  U2(n)  (10) 

n— >00  n— >'» 

The  above  framework  includes  the  formulations  of  [1]  -17)  as  special  cases.  In  [  1 1-[51,  the 
common  probability  mcxlel  formulation  can  be  captured  in  the  above  framework  by  letting  the  set  1  be 
the  singleton  set  {If.  In  this  case,  the  probabilities  P],P2  ate  trivial,  and  the  remaining  probabilistic 
framework  corresprrnds  to  the  general  framework  presented  in  |51.  The  formulation  of  10]  can  he 
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captured  by  selecting  Pj,  P2  to  be  purely  diagonal  measures,  of  the  form 

P —  0  ==  M ^*2* 

and  the  selection  of  i,.  h  is  such  that  i,  vs  ij.  In  this  case,  each  decisiomaker  is  convinced  that  the 
other  decisionmaker  will  use  the  same  probability  model  as  he  does.  However,  the  initial  models 
selected  for  each  player  may  be  different. 

3.  GENERICITY  AND  CONTINUITY  OF  CONSENSUS:  THE  SECRET  MODEL  PROBLEM 

In  this  section,  we  analyze  the  model  of  (6)  to  determine  how  likely  are  agreement  or 
disagrcerpcnt  outcomes.  We  sep,irate  our  results  into  two  cases:  the  case  when  the  decision  variables 
are  continuous,  and  the  case  when  the  decision  variables  are  di.scrcte.  In  order  to  specialize  the 
formulation  of  section  2  to  the  problems  investigated  in  (6]  and  [7],  we  make  tlie  following 
assumptions: 

Al.  There  exists  some  A  e  {K  ‘  V  such  that 

PM(A);^  P'2(A),  (12) 

A2.  The  beliefs  of  each  DM  concerning  the  other  DM's  probability  model,  Pj  and  P^,  satisfy  eq. 

11. 

Assumption  Al  guarantees  that  the  differences  in  the  DM’s  models  are  detectable  with  the  available 
observation.  Assumption  A2  specifies  that  the  knowledge  that  the  models  may  be  different  is  secret 
knowledge  to  each  DM.  '  ■  ' 

3.1  Continuous  decision  variables 

When  the  decision  space  U  is  a  continuous  space,  we  make  the  following  additional  assumption 
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A3.  U  is  a  convex  subset  of  R",  and  J(u.u)  is  a  strictly  convex,  differentiable  function  of  u  for 
each  0). 

Assumption  A3  guarantees  that  there  exist  unique  solutions  to  eqs.  7  and  8.  Note  that  A1  -  A3  are 
satisfied  by  the  model  in  [6],  since  the  decisiomakers  exchange  the  conditional  probability  of  an  event 
(X)  occurring  given  their  information.  In  this  case, 

J((o,u)  =  (I{  o>  e  X  }  -  u)^,  (13) 

so  it  satisfies  A3.  The  assumptions  in  [6]  concerning  the  different  probability  models  of  the 
decisionmakers  correspond  to  A1  and  A2. 

In  order  to  characterize  the  likelihood  of  agreement  or  disagreement  results,  we  need  the 
following  definitions:  Let  11  be  the  space  of  all  probability  distributions  op  (Q,  {/  *  V  Since 

{y  1  V  y  2}  is  a  finite  o-field,  11  is  a  simplex  in  R”,  where  n  is  the  cardinality  of  the  atoms  of 
{Y  *  V  y  Alternatively,  11  can  be  viewed  as  a  subset  of  with  positive  Lebesgue  measure. 

Definition:  A  result  is  said  to  be  generic  in  0  if  and  only  if  the  set  {FI  £  Til  result  is  not  true  for  H} 
has  zero  n-l-din.ensional  Lebesgue  measure. 

Proposition  1:  Under  assumptions  A1-A3,  if  for  all  B  o  A,  either 

i.  Uj(0))=  arg  min  Epi({J(o),u)!  B}  eU®,  or 

ueU 

ii.  U2(co)  =  arg  m.in  Epi2{J(co,u)l  B}  eU° 

ueU 

where  U'^  is  the  interior  of  U,  it  is  generic  that,  for  some  to  e  Q,  a  contradiction  will  be  reached  in  the 
process  of  consensus. 

Proof:  Without  loss  of  generality,  assume  condition  i  holds.  Consider  any  instance  when  a  tentative 
decision  is  sent  from  DM  1  to  DM  2.  Denote  that  decision  as  u,  ,  and  the  information  o-field  available 
to  player  I  as  F  j.  Then.  Uj  is  an  F  j  -  measurable  random  variable  satisfying 
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Uj(co)  =  arg  min  Epij  {  J(o,u)  1  F  j}. 
ueU 

In  order  for  a  contradictic.i  not  to  occur,  DM  2  must  be  able  to  interpret  U|(0))  in  terms  of  i.'s  own 
probability  model;  that  is, 

U2)(cd)  =  arg  min  Epi2  {  J(ci),u)  \F2x},  (15) 

ueU 

where  F  2\  is  DM  2’s  perception  of  the  information  available  to  DM  1 .  Note  that  F  2  j  ts  a  coarser 
a-field  than  {K  *  V  y  hence  it  is  also  a  finite  field.  Let  U2J  denote  the  following  subset  of  U: 

U2J  =  {u  eU  I  u  =  arg  min  Epi2  {J(to,u)|B}  for  some  B  e  F21,  for  some  o-field  F  2] 

satisfying  {y'  V  K^}  3^2, 

Note  that  U2J  is  a  finite  set,  since  each  o  field  F  2j  is  finite  and  there  are  only  a  finite  number  of 
o-fields  satisfying  the  inclusion  conditions.  A  necessary  condition  for  the  consensus  process  not  to 
reach  a  contradiction  is  Uj((0)  e  U21  for  all  cue  D. 

Let  fj  denote  the  atoms  of  {K  *  V  Y^].  Let  F,(OJ)  denote  the  atom  of  F  j  confaini  w  o),  and 
f(0))  denote  the  atom  of  { K  ’•  V  y  2^  containing  ox  Select  co  e  A  such  that  PL(f(co))  P'2(f(co)).  Such 
3.  03  exists  by  3SsurTjption  A2  Then, 

Ui(co)=  argmin{Epi,{  Epi.,(J(co,u)l{ri  Vy2}}|F,} 
ueU 

j  J(v,u)  P^’  (dv) 

=  argminE  i,  [  - |  F^lo;)  } 

^  «  U  j  p’t 

V€f(w) 

Define  J(f,.u)  as 


Q 


O' 


Then, 


j  J(v,u)  (dv) 
j(f,,u)  =  ygfj  _ 

P^Nfj) 


2  p ’(f,) 

Uj(ci))  =  argmin  {'  _ _ 

P‘’(F, ((!>)) 


)  (17) 


By  assumption,  the  minimizing  value  is  in  the  interior  of  U;  a  necessary  aixl  sufficient  condition 
characterizing  U)(Ci))  is 

S  PM(fi)  2Ll(fi.Ui«o))  =  0.  (18) 

F,(CD)afi  au 

Since  Uj(o))  must  belong  to  U2],  this  means,  for  some  a  e  U2],  A  c  P*  2I’ 

1  P‘2(fi)  2.  l(fi.a)  =  0.  (19) 

Aafj  au 

For  each  o,  A  the  set  of  P*2  in  11  satisfying  eq.  19  has  n-1  dimensional  Lebesgue  measure  0,  since  eq. 
19  imposes  a  linear  constraint  on  P‘2.  Since  there  is  a  finite  number  of  a  in  U21  and  A  in  F  2j,  the 
set  of  P‘2  in  li  satisfying  eq,  19  for  some  ct,  A  also  has  n-1  dimensional  Lebesgue  measure  0,  which 
implies  that  a  contradiction  is  generic  for  some  to  in  A.  q.e.d. 

The  results  of  proposition  1  can  be  understood  in  terms  of  the  example  below: 

Example  1:  Let  fi  =  (0,2)  x  [0,3 j,  and  let  F  denote  the  Borel  sets  in  Let  the  a-field  Y  ’  be 
defmed  by  the  atoms  {(ofto  E  a^ },  {toiti)ea2},  and  the  o-field  K  ^  be  defined  by  the  atoms  {o)|a)  e  b, }, 


{o)|(oeb2}  and  {(o|coeb3},  where  aj,  bj  are  defined  in  figure  1.  Define  probability  models  P^  as: 

P^(A)  »  p(A}/6 ,  where  p  is  two-dimensional  Lebesgue  measure 

p2(A)  -  p(Anb,)/36  +  p(Anb2)/24  +  7n(Anb3)/72. 

P*  is  unifonnly  distributed  over  D,  while  P^  is  uniformly  distributed  conditioned  on  b^,  but  has 
p2(b,)«l/6;  p2(b2)»  l/4;  p2(b3)-7/12. 

Let  i|  -  1,  while  12  >  2«  so  that  DM  1  uses  probability  model  P^  while  DM  2  uses  jxobabili^  model 
P^.  The  decision  rule  used  by  the  DM's  is  defined  by  eq.  13,  where  the  event  is  event  X  in  figure  1. 
As  noted  in  [6],  when  <■>  e  a^  n  53,  and  1  communicates  first,  there  is  eventual  agreement, 
although  the  DMs  have  very  different  reasons  for  reaching  that  agreement  hi  order  to  show  that 
disagreement  is  generic,  we  will  show  that  arbitrarily  small  perturbations  to  P^  will  result  in 
disagreement  Specifically,  let 

P(b,)  -  (2  +  e,yi2;  Pfbj)-  O+ejVn ;  P0>3)  -  (7-e, -62X12. 

As  in  [6],  DMl's  fust  communication  is  U}  «  In  <»der  for  this  value  not  to  be  a  contradiction, 
either  , 

3  Ej  +  ^£2  =0  {20a) 

or 

€j+2€2«4.  (20b) 

The  two-dimensional  Lebesgue  measure  of  the  set  of  all  E|,  62  satisfying  eqs.  20  a  or  b  is  0,  since  it  is 
the  union  of  two  lines.  Hence,  for  almost  all  choices  of  £],  £2,  a  contradiction  will  be  reached  in  the 
first  communication. 

The  reason  for  the  gcncricity  of  disagreement  in  proposition  1  is  that,  although  ek:h  DM  can 
observe  wily  a  finite  number  of  observation  values,  he  can  communicate  a  continuous  number  of 
decisions.  This  enables  the  other  DM  to  detect  differences  in  the  probability  models.  Conditions  i  or  ii 
in  proposition  1  guarantee  that  the  announced  decisions  will  vary  with  small  differences  in  probability 
models. 


3.2  Discrete  Decision  Variables 

In  this  subsection,  we  assume  that  the  space  U  is  discrete.  Let  fj  denote  an  atom  of  {Y  Wy 
We  define  a  metric  on  0  ^  follows: 

ForP*,p2en, 


d(P*4>2)  .  niax  |P>(fi)-p2(fi)|. 

fi 


This  metric  is  equivalent  to  the  Euclidean  metric  on  11. 


Definition:  An  agreement  or  disagreement  result  is  said  to  be  continuous  in  O  at  P'l,  P‘2  if 
agreement  or  disagreement  continues  to  hold  for  all  P^  P^  in  a  neighborhood  of  P‘1,  P‘2. 

Assume  in  addition: 


A4.  For  any  A  e  {K  *  V  F  exists  unique  Uj,  U2  in  U  such  that 


Uj  =  arg  min  Eptj  {J(a),u)  |  (U  £  A} 
ueU 


U2  =  arg  min  Epi2  {J(o),u)  |  (O  E  A} 
ueU 


With  this  assumption,  we  have  the  following  characterization  of  agreement  or  disagreement  outcomes: 

Proposition  2.  Under  assumptions  Al,  A2  and  A4,  if  agreement  occurs  for  P'l  J*‘2,  it  is  continuous  in 
n.  If  disagreement  occurs  for  P'l,  P‘2,  it  is  continuous  in  n. 

Proof:  Without  loss  of  generality,  assume  that  P'l,  P‘2  result  in  agreement  Denote  by  F  ,(n)  {F  2(n)) 
DM  I’s  (  DM  2's )  sequence  of  a-ficlds  generated  in  the  consensus  process.  Each  one  of  these  fields 
is  coarser  than  { F  '  V  F  2},  hence  finite.  For  any  time  interval  n,  the  atoms  of  F  j(n)  and  F  2(n)  are 


elements  of  {y  *  V  K  For  any  A  e  {F  W  F  define  the  function 


2  p^’(fj)J(vu) 

L(P^»,  A  )  =  - -  }  (22) 

P*‘(  A  ) 


This  is  a  continuous  function  in  II-  Because  of  asumpdon  A4,  we  can  find  a  5}(A)  such  that,  for 
dd-l.Pb  S  5,(A). 


2  p‘(fpi(fi.u) 


argmin  f 
u  £  U 


.  f,  c  A 


P’(  A  ) 


2  p’(f,) 


argmin  f 
u  €  U  ‘ 


.  f ,  c  A 


P’(  A  > 


A  similar  result  can  be  established  in  terms  of  52(A)  for  DM  2's  decisions.  Select  a  as 

5  =  min  {  6i(A),52(A) }.  (23) 

{y,vy2}=A 

This  minimum  exists  because  there  are  only  a  finite  number  of  A  in  {T  *  V  y  2).  This  choice  of  5 
guarantees  that  the  exchanged  sequence  of  decisions  and  the  o- fields  inferred  by  the  other  DM  are  the 
sanie  for  all  probabilities  satisfying 

.  d(PM,P*)  S  5 
d(P’2,  p2)  <5. 


, 


.  •  .V.  • 


thereby  completing  the  jKOof.  q.e.d. 


The  result  of  proposuion  2  depends  critically  on  assumption  A4.  However,  an  argument 
similar  to  the  prot^  of  {nopos.tion  1  establishes  the  following  result 

Proposition  3.  If.  for  each  atom  fj  e  f  T 

l(fi,u)itl(fi,v)  if  u?tv, 

then  Assumption  A4  is  generic  in  n< 

Proof:  If  assumption  A4  does  not  hold  for  PM,  there  must  exist  a  set  A  £  {T  *  V  y  and  u,  v  e  U, 
u^,  such  that 

Epij  {J((ii),u)  1 0)  e  A}  «  Epi|  {J(o>,v)  |  a>  e  A}. 

This  implies  that 

2  p*’ (f,)  j(fi,u)  s  2  p’’ (fi)  iKfi.v) 

f,cA  •  f,CA 

Since  Kfj.v)  ^  KfpU)  for  any  atom  of  {T  ‘  V  y  2},  this  implies  that  the  set  of  PM  which  satisfy  this 
equation  has  Lebesgue  measure  0  in  H-  A  similar  argument  for  P*2  completes  the  proof,  q.e.d. 

Example  2:  Let  (£*/"  ),Y  |,  y  2.  swd  X  be  defined  as  in  example  1.  Define  probability  models  P^ 
P^  as: 


P^(A)  =  2p(Anaj)/15  +  p(Ar'a2)/5,  where  p  is 


two-dimensional  Lebesgue  measure. 


P^(A)  =  |i(Ana2)/5 +  44(Ar'ajnb|nX)/95  +  12ji(Anajnb|OX)+  12^(Ar>b2r^a|)/95  + 
3(5^(AnaJnb3r^X)/95  +  12n(Aoainb3rtX)/95 

where  X  is  the  complement  of  X  in  £2.  Rather  than  work  with  the  unconditional  probabilities,  what  is 
important  is  to  evaluate  the  conditional  probabilities  of  events  given  available  infnmation.  Thus, 
is  uniformly  distributed  conditioned  on  a^,  with 
p3(a,)^-.4:  p3(a2)  =  .6. 

P^  has  the  same  distribution  as  P^  on  32,  but  differs  on  aj,  as 


P^(X|a,nbi)  =  .25;p3(Xlainbi)=.5  (24a) 

p4(XIainb2)  =  p3(X|ainb2)  =  .75  (24b) 

P^(X|ainb3)  =  JO ;  p3(XIainb3)  =  .25  (24c) 

p4(X|ain(b2ub3))  =  .6;  p3(X|ain(b2ub3))=  J  (24d) 

P^(X|ai)  =  10/19  ;  P^fXIaj)  =  J  (24e) 


Let  U  =  {0,1}.  Let 

J((i),0)  =  J3  if  to  e  X, 

=  0ifa)e2i. 

J(©,1)  =0if(DeX 

=  .47if(aeVX. 

With  this  defuiition  of  J,  the  optimal  decision  for  DM  1  given  an  information  set  A  is  given  by 

uj  =  1  if  P‘j(a)eX| A)  >  .47  (25) 

=  0  otherwise. 

The  same  decision  rule  is  optimal  for  DM  2,  using  the  probability  P'2. 

Assume  i  j  =3,  while  i2  =  4,  that  DM  1  exchanges  his  decision  first,  and  that  the  DM's  alternate  in 
exchanging  decisions.  As  in  example  1,  assume  that  toe  aj  n  b3.  From  eqs.  24e  and  25.  DM  I’s 
initial  decision  is  1 .  That  is,  u  j  ( 1 )  =  1 .  If  DM  1  's  information  had  been  82,  his  decision  would  have 
been  U|  =  0.  Hence,  according  to  DM  1,  he  has  signalled  toe  aj  to  DM  2. 


s’." 
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According  to  DM  2’s  probability  model,  uj(l)  =  1  implies  (oe  aj.  Hcncc,  DM  2  believes  (oe  a]  r> 
b3.  His  optimal  decision  is  U2(l)  =1.  because  of  eq.  24c.  According  to  DM  2,  his  decision  has 
signalled  O)  e  i>2  ^  t>ccause  of  eqs.  24a  and  24b. 

Because  of  eqs.  24  a,b,c,  DM  1  inteipreis  U2(l)  =«  1  to  mean  a>  e  a]  n  (b|Ub2).  His  optimal  decision 
isu](2)=  1.  Hence,  a  consensus  has  been  reached  at  u  2(2)  =  U2(2)=  1.  However,  they  have 
reached  this  agreement  for  the  wrong  reasons,  since  DM  1  believes  co  e  a2  n  (b|Ub2),  whereas  in 
actuality,  OOE  a2  o  b3!  Note  that  any  changes  in  either  orP^  which  would  change  the  numbers  in 
eqs.  24  a  e  by  less  than  .02  would  continue  to  result  in  agreement 

Suppose  that  the  order  of  communication  is  reversed,  so  that  DM  2  communicates  flrst  The  optimal 
decision  U2(l)  =  1-  Note  that  P^(X  |  (O  E  b2)  <  .45,  P^(X|a)  E  b2)  <  .2 .  Hence,  DM  2  believes  he 
has  signaled  (0  E  1)3. 

According  to  DM  1,  he  interprets  U2(l)  =  1  to  mean  to  E  b3.  Hence,  he  believes  0)  E  a2  O  b3.  His 
optimal  decision,  according  to  eq.  24c,  is  U2(l) »  0.  This  decision  cannot  be  understood  by  DM  2, 
because  he  expected  uifl)  »  1  whether  DM  1  knew  aj  or  32.  Ifence,  the  DMs  have  reached  a 
contradiction.  Note  that  this  contradiction  will  be  reached  even  if  P‘2  or  P'2  arc  modified  by  .02. 
Hence,  the  disagreement  outcome  is  also  continuous. 


The  above  results  illustrate  that,  when  the  decision  S2>aces  are  discrete,  small  discrej^ncies  in 
the  probability  riKxlels  of  the  decisionmakers  will  not  affect  the  consensus  process.  They  also  show 
that  the  set  of  pairs  of  probability  models  for  which  cemsensus  occurs  has  2X>sitive  Lebesgue  measure 
in  n,  unlike  the  result  in  the  continuous  decision  case  of  the  previous  section.  However,  the  set  of 
pairs  of  prrobability  models  for  which  contradictions  occur  also  has  positive  Lebesgue  measure. 
Hence,  contradictions  are  common  [rficnomena  in  the  consensus  process. 

The  question  still  remains;  How  does  the  consensus  process  proceed  once  a  contradiction  is 
encountered?  Such  a  contradiction  reveals  that  the  basic  assumption  that  P]  (i  2 ,i2)  =  P2(i  j  ,i2)  =  0  if 
ii^i2,  is  violated.  In  the  next  section,  we  present  a  plausible  model  for  this  process,  and  study  its 
implications. 


4. 


CONSENSUS  PROBLEMS  WITH  MUL'FIPLE  PROBABILITY  MODELS 

When  the  decisionmakers  in  the  conso^'us  process  have  different  subjective  views  of  the 
world,  ahd  Uiese  differences  are  secret  knowledge,  the  results  of  [6]-(7]  show  that  a  contradiction 
outcoms  is  often  reached  whereby  the  existence  of  these  differences  becomes  common  knowledge.  At 
this  point,  our  model  of  how  the  consensus  process  proceeds  is  that  each  decisionmaker  models 
statistically  the  types  of  probability  models  which  the  other  decisionmaker  may  be  employing,  and 
acquires  information  through  the  consensus  process  concerning  the  possible  models  used  by  the  other 
I^,  and  the  uncertainqr  in  the  event  space  CL  Within  this  framewMk,  we  investigate  convergerKe  and 
agreement  issues  fcx*  two  cases:  When  the  statistics  of  the  types  probability  models  are  common 
knowledge,  and  when  these  statistics  are  seoet  knowledge.  The  analysis  is  based  on  the  mathematical 
formuladon  developed  in  section  2,  where  the  probability  distributions  P],  P2  represent  the  statistics 
used  by  each  DM. , 

We  make  the  following  assumptions: 

A5:  P| .  P2  are  common  knowledge. 

A6:  The  decision  rule  d  satisfies  the  agreement  condition 

Recall  that  Pj  J*2  ^  subjective  probabilities  on  the  space  of  possible  model  pairs  (i|,  i2). 
Assumption  A5  implies  that  the  subjective  statistical  distribution  of  possible  probability  models  for 
each  decisionmaker  is  known  to  the  other  decisionmaker,  and  this  fact  is  common  information.  Note 
that  we  do  not  assume  that  these  distributions  are  equal.  This  allows  DM  1  to  believe  he  has  a 
different  range  of  possible  decision  models  than  DM  2  has,  and  viceversa. 

The  decision  rule  d  is  said  to  satisfy  the  agreement  condition  if ,  whenever  G  |  2  G  2  2 
o(d(G  j)),  then  d(G  j)  =  d(G  2).  The  agreement  condition  implies  that,  if  a  decision  is  based  on 
information  which  is  common  to  the  information  a-fields  G  2  and  G  j,  then  knowledge  of  either  G  2 
or  G  J  would  result  in  the  same  decision.  In  [5]  and  [7],  a  sufficient  condition  was  developed  to 
characterize  when  a  decision  rule  satisfies  the  agreement  condition.  When  specialized  to  our  model, 
this  condition  can  be  stated  as: 


Proposition  4:  Assume  that  there  is  a  total  order  <  on  U.  Let  D|(o>),  D2(ci))  denote  the  set  of  solutions 
of  eqs.  7, 8  for  eaich  ©.  If  U|(a)),  U2(0  are  selected  to  be  the  minimal  elements  in  Dj((d),  D2(o)) 
respectively,  then  the  decision  rules  dj,  d2  satisfy  the  agreement  condition. 

Proof:  See  [5],  [7]. 

Under  assumptions  A5  and  A6,  we  can  prove  the  following  result: 

Proposition  5:  Under  Assumptions  AS  and  A6,  if  ij  « 12  aixl  (D  e  is  such  that  the  consensus 
process  reveals  that  i  j  » i2  to  both  decisionmakers,  then  the  decisionmakers  reach  a  consensus  for  ox 

Proof:  Without  loss  of  generality,  let  ij  =»  i2  “  1.  The  consensus  process  starts  with  the  initial 
information  o-fields 


Gi(0)  =  ylxHi 

;26a) 

G  2(0)  =  Y^xH2 

(26b) 

After  each  communication  is  heard,  each  decisionmaker  learns  additional  information.  The  evolution 
of  information  of  each  decisionmaker  can  be  described  by  the  evolution  of  a  dynamical  system  in  the 
lattice  of  sub-<^ficlds  of  £1,  as  in  [5]: 

Gi(n+1)  =  Gi(n)  V  a(d2(G2(n)) 

(27  a) 

G2(n+1)  =  G2(n)  V  o(di(Gi(n)) 

(27  b) 

(or  G  i(n+l)  =  G  j(n)  V  a(d2(G2(n+l)) 

(27  c) 

G2(n+1)  =  G2(n)  V  <j(d,(G  i(n)), 

(27  d) 

depending  or.  whether  communications  are  simultaneous  or  staggered), 

where  djfG  )  is  the  decision  rule  of  eqs.  7  or  8  applied  to  the  atoms  of  the  rr-field  G  .  The  lattice 
operations  are  V  and  A,  where  A  W  B  represents  the  coarsest  o-field  containing  both  A  and  B  ,  and 


i4  AB  is  the  finest  o-field  co.itained  in  both  A  and  i? .  Because  of  A5,  the  evolutions  indicated  in  cq. 
27  are  common  knowledge.  Note  that  these  dynamical  systems  arc  evolving  on  a  lattice  of  finite 
fields,  and  that  they  generate  a  strictly  increasing  sequence  of  o-ficlds.  Hence,  after  some  finite  time  t, 
a  limit  must  be  reached  such  that,  for  all  s  >  t, 

G  i(s)  =  G  1  =  C  1  V  o(d2(G 2)) 

G2(s)  =G2=‘G2  V  o(di(G  1)).  (28b) 

Eqs.  28  establish  that  the  consensus  process  converges  to  a  limit;  that  is, 

lim  uj(n)  =  U|*;  lim  U2(n)  =  U2*.  (29) 

n— »oo  n-400 

In  addition,  eq.  28  implies  that 

Giao(d2(G2))  (30a) 

G2  20(di(Gi))  (30  b) 

Furthermue,  the  fields  generated  by  a  decision  rule  arc  cemtained  in  the  infOTmation  available  for 
decisions.  That  is, 

O2ZI  o(d2(G  2))  (31a) 

Gi2a(di(Gj))  (31b) 

Hence, 

G22  G2AGJ  a  o(d2(C  2))  (32  a) 


G  I  a  G  2  AG  j  a  <T(di(G  j)) 


(32  b) 


By  ^sumption,  the  decision  rules  satisfy  the  agreement  condition.  Hence,  by  eq.  32, 


di(G  j)  =  dj(G  j  A  G  2) 


d2(G  2)  =  ^2^^  1  ^  ^  2^ 


(33) 


Select  (D  e  Q  such  that  ii  »  1  =  i2  is  common  knowledge.  Since  the  o-fields  are  increasing,  there  is  an 
atom  g  containing  (O  in  G  ^  A  G  2  of  the  formg  ^  (A,l,l) ,  where  12  2  A.  Since  g  is  an  element  cf 
both  limiting  fields  G  j  and  G  o ,  it  follows  that 


djfg)  =  arg  min  eQi  {  J(o),u)|g} 
ueU 


(34) 

arg  min  {Pjfl,!)  Epl  {J(to,u)|o)e  A}}. 
ueU 


Similarly,  since  i2  =  1  in  g, 

d2(g)  =  arg  min  £^2  {  J(<D,u)|g} 
u  e  U 

(35) 

=  arg  min  {P2(l,l)  Epl  {J(o),u)tcoe  A}}. 

,  u  e  U 

Note  that  the  functions  being  minimized  are  simple  multiples  of  each  other.  Thus,  d2iLg)  =  d|(g)  for 
all  such  g.  Coupled  with  eq.  33,  this  completes  the  proof.q.e.d. 

According  to  the  above  proposition,  even  if  the  decisionmaI.ers  have  the  same  probability 
model,  consensus  is  not  guaranteed  unless  it  becomes  comon  knowledge  that  ij  =  i2.  On  the  other 
hand,  the  condition  that  co  e  O  is  such  that  the  consensus  process  reveals  that  i  j  =  i2  is  sufficient  but 
not  necessary  for  reaching  a  consensus.  Tl.ese  points  are  illustrated  by  the  following  example. 
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Example  3:  Let  (QJF ),  K  j,  K  2»  X,  P^,  P^,  U  and  J(0),u)  be  defined  as  in  example  2.  Consider  the 
decision  rule  defined  in  eqs.  7  and  8.  Assume  that  ij  =  i2  =  3,  so  that  both  decisionmakers  have  the 
saxTie  probability  model.  Assume  further  that  there  exists  a  probabi?  »y  distribution  Q  on  {3,4}  such 
that 


P,(il=j.i2=k)  =  Pi(ii=j.i2-k)  =  Q(j)Q(k).  (36) 

That  is,  each  decisionmaker  believes  that  the  other  DNfs  decision  model  is  selected  independently 
from  a  known  statistical  population,  where  the  set  of  possible  models  v/as  {P^JP^}.  Furthermore,  the 
statistics  of  the  selection  are  known  identically  to  both  decisiomakers.  However,  the  precise  model 
selected  is  private  knowledge  provided  to  each  decisionmaker. 

Assume  Q(3)  =  .1.  As  in  example  2,  we  assume  that  the  DMs  alternate  exchanging  tentative 
decisions.  As  in  example  2,  DM  I's  first  decision  is  uj(l)  =  1,  and  this  signals  that  w  e  aj  to  DM  2. 
This  decision  does  not  reveal  whether  ij  =  3  or  i j  =  4,  because  »f  ij  =  3  or  4,  the  same  decision  would 
be  made. 

At  this  point  in  the  consensus  process,  DM  2  knows  that  we  a|  n  63  whether  i|  =  3  or  4.  Hence,  by 
eq.  8,  since  i2  =  3,  the  optimal  decision  is  U2(l)  *  0.  In  order  to  identify  the  information  signalled  by 
this  decision,  we  must  examine  the  optimal  decisions  corresponding  to  the  possible  information  sets 
that  DM  2  could  have,  from  DM  I's  perspective.  These  decisions  are: 

If  i2  =  3,  coe  aj  n  bj  ,  then  U2(l)  =  Ibyeq.  24  a 
If  i2  =  3,  (0  e  aj  o  62  j  then  U2(l)  =  1  by  cq.  24  b 

If  i2  =  3,  to  e  aj  n  b3  ,  then  U2(l)  =  0  by  eq.  24  c 

If  i2  =  4,  toe 3]  n  bj  ,  then  U2(l)  =  0  by  eq.  24  a 

If  i2  =  4,  to  e  n  b2  ,  then  U2(l)  =  1  by  eq.  24  b 

If  i2  =  4,  toe  aj  n  b3  ,  then  U2(l)  =  Ibyeq.  24  c. 

Hence,  DM  1  knows  that  either  i9  =  3,  to  e  a  j  n  b3  or  i2  =  4,  to  e  a^  n  bj.  According  to  eq.  7,  his 
optimal  decision  is  selected  as 
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ui(2)  =  argmin  E^l  {  J((0,  u)|F} 
ueU 


<*  i\ 


•‘v  r. 


=  argmin{  Q(3)  I(.25)(.53)I{u  =  0}  +  (.75)(.47)I{u=l }]  + 

0(4)  r(J)(33)  l{u=0)  +  (.5)(.47)I{u=l)]) 


where  I{  }  is  the  indicator  function,  and  a  constant  scaling  factor  has  omitted.  It  is  easy  to  see 

that,  for  0(3)  <  .12,  ui(2)  =  i.  Note  that,  if  ij  =  4,  eq.  37  becomes 


■v. 

I  •  •  f*  t 

'V-  Jv 


ui(2)  -  arg  min  {  0(4)  [(2/3)(.53)I{u  =  0}  +  (2)(.47)I{u=l  }1  + 

«e{0,l}  O(3)I(3)(.53)I{u=0}  +  (3)(.47)I(u=l}]}  (38) 

So,  u  j(2)  should  be  0  if  ii  “  4.  Therefore,  uj(2)  <=  1  signals  that  i  j  »  3  to  DM  2. 


V '  sr 

>j.  I 


DM  2  now  knows  ij «» i2  =  3,  and  m  e  al  n  b3.  As  before,  his  (^dmal  decision  is  U2(2)  =  0.  This 
decision  does  not  convey  any  additional  information  to  DM  1 ,  because  the  decision  U2{2)  =  0  did  not 
depend  on  ihe  information  il  =  3  Since  DM  1  obtains  no  additional  information,  his  tentative  decision 
continues  tc  be  ul(2)  =  1,  and  the  two  decisionmakers  agree  that  an  agreement  cannot  be  reached.  The 
common  information  which  forms  the  basis  for  this  disagreement  car  be  sununarized  in  the  atom 


{(m,il,i2)  I  (0)  e  aj  n  b3  ,  ij  =  i2  =  3),  or  ^to  e  aj  o  bl,  ij  =  3,  i2  =  4)}. 


Consider  now  the  same  problem,  but  assume  that  DM  2  communicates  first  Then,  since  ©  e  bj  and 
P^(X  I  b3 )  =  J5,  then  U2(l)  =  l.DM  1  observes  al  and  receives  U2(l)  =  1;  by  the  same  argument  as 
above, he  concludes  that  DM  2  has  observed  b^  and  uses  either  model  3  or  4.  His  decision  is  u  j  ( 1 )  =0. 


'  -J  ^  A 

C: 


DM  2  is  aware  that  DM  i  knows  both  aj  and  b3,  so  when  he  receives  uj(l)  =  0,  he  interprets  this  to 
mean  that  ij  =  3  and  w  e  a|  and  b3.  Hence,  he  communicates  U2(2)  =  0.  This  reveals  that  i2  =  3, 
and  the  decisionmakers  reach  a  consensus. 


■y 


:• 


The  results  of  example  3  are  rather  surprising.  Unlike  the  cases  studied  in  [6]  or  [7],  there  is 
no  unmodeled  secret  information  present  in  this  consensus  process.  Indeed,  both  DNTs  are  actually 
using  the  same  probability  model;  furthermore,  they  have  identical  probability  distributions  over  the 
class  of  probability  models,  and  this  is  common  knowledge!  Nevertheless,  a  disagreement  outcome 
occurs.  This  implies  that  even  admitting  the  possibility  diat  the  other  DM  can  have  a  different 
subjective  probability  nuxiel  than  yotir  own  is  sufficient  to  prevent  reaching  a  consensus.  The  reason 
for  this  effect  is  the  difference  in  the  probability  distributions  used  by  each  DM  in  eqs.  V  and  8  when 
one  DM  is  unable  to  identify  the  probability  model  used  Ity  the  other  DM. 

Ifow  likely  is  it  that  the  conditions  of  propositkm  5  are  met?  Oir  analysis  of  the  previous 
section  can  be  extended  to  establish  die  following  fnoposidons: 

Proposition  6.  Suppose  that  '.he  decision  space  U  was  continuous,  and  that  assumption  A3  was  true. 
Assume  i^  =  i2-  Assume  additionally  that,  for  all  B  e  £ 

L  uj((U)»  argmin  EQl{J(ti),u)|B}  eU® 
ueU 


il  U2(m)=  argmin  eQ2{J(0),u)|  B}  eU° 
ueU 

where  U®  is  the  interior  of  U.  Then,  the  outcome  that  the  consensus  process  will  reveal  that  ij  >=  i2  is 
generic  in  n^,  where  k  is  the  cardinality  of  L 

Proposition  7:  If  the  decision  space  U  is  discrete,  i^  » i2,  and  for  any  A  e  £,  there  exists  unique  U|, 
U2  in  U  such  that 

U]  =  argmin  E^l  {J(oj,u)  [  A}  , 
u  E  U 

U2  =  arg  min  eQ?  {J(o),u)  I  A) 
ueU 

then  the  outcome  that  a  consensus  process  reveals  that  i]  =  12  for  a  specific  (O  is  continuous  in 


The  proof  of  these  propositions  follows  directly  the  proof  of  propositions  1  and  ?,  and  will  not 
be  reproduced  here.  Essentially,  proposition  6  is  based  on  the  fact  that  the  set  of  probability  models 
for  which  a  continuous  decision  fails  to  discrimirtate  among  a  finite  set  of  models  has  zero  Lebesgue 
measure  in  the  space  of  all  possible  probability  models.  Under  the  assumptions  of  proposition  7,  one 
can  show  that  the  sequence  of  o-fields  generated  in  the  consensus  process  does  not  change  with  small 
perturbations  in  the  set  of  individual  probability  models. 

When  ij  #  i2,  it  is  possvble  to  show,  by  arguments  similar  to  those  leading  to  eqs.  27*32  that 
the  sequence  of  decisions  {djCC  i(n))}  and  {d2(G  2(0)))  will  converge  to  dj(G  j)  and  d2(C»  2^ 
respectively.  However,  since  the  probabilistic  iiKxlels  of  1  and  DM  2  are  not  die  same,  whether 
or  not  a  consensus  is  reached  depends  on  the  event  me  D  and  the  order  of  eonununkation.  If  g  is  an 
atom  of  G  ]  A  G  2  containing  <0  e  D,  dien  a  consensus  will  be  reached  if  d|(g)  =  d2(g). 

The  above  results  were  based  on  the  assumption  that  the  underlying  statistical  models  Pj  and 
P2  used  by  each  DM  are  common  knowledge.  When  these  mockls  different,  and  this  fact  is  secret 
knowledge,  and  the  decision  processes  of  DM  1  and  2  are  consistent  with  their  own  beliefs,  dien 

the  consensus  i»x)cess  reaches  one  of  three  different  outcomes  after  a  finite  number 
communications: 

1.  A  consensus  is  reached, 

2.  DM  1  and  DM  2  realize  that  their  underlying  statistical  models  are  inconsistent, 

3.  DM  1  and  DM  2  agree  to  disagree  because  they  cannot  gather  any  additional  information 
from  the  consensus  process. 

In  OTder  to  establish  this,  we  must  describe  the  evolution  of  the  decision  processes  according  to 
each  DM's  subjective  decision  model,  and  determine  what  each  DM's  model  predicts.  Then,  we 
compare  the  pircdicted  communications  with  the  actual  communications  heard.  Let  U]  |(n),  U|2(n) 
denote  the  decisions  of  DM  1  and  DM  2  at  stage  n  according  to  DM  I’s  subjective  decision  model. 
Similarly,  let  U2](n),  U22(n)  denote  the  decisions  of  DM  1  and  DM  2  at  stage  n  according  to  DM  2's 
subjective  decision  model.  Then,  according  to  DM  I’s  view. 


(39  b) 


where  djj  denotes  the  decisions  formed  by  the  decision  rule  d  according  to  the  probability  measure 
Vrii.  Similarly,  according  to  DM  2's  view. 


U2i(n)  -  d2i(y^U|2(l),...,ui2(n-l)) 


(40  a) 


U22(n)  ■  d22(y  »“ll(^)»-"»“ll(“-l)). 


(40  b) 
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where  d2j  denotes  die  drcisions  formed  by  die  decision  rule  d  according  to  the  probability  measure 
Vn2*  Fquations  39  and  40  describe  the  evolution  of  the  consensus  process  according  to  I's  and 

DM  2*s  perception,  respectively. 

1  o  determine  what  1  and  DM  2  predict  about  the  outcome  of  the  decision  processes  in 

tenns  <rf  dieir  own  perceptions,  we  define  four  sequences  of  information  c-fields,  representing  DM 
I's  actual  knowledge  (G  j  i(n)),  DM  I's  belief  of  DM  2's  knowledge  (G  i2(n)),  I»!  2's  actual 
knowledge  (G  22(n))  and  DM  2's  belief  of  DM  I's  knowledge  (G  2i(n)).  These  fields  evolve  under 
communications  as: 


i  ? 

J:  ■' 

^  *.* 

it  '.v 

I- 

& 

V  , 

I’ 


G  I  i(n+l) «  G  I i(n)  V  o(di2(G  i2(n))) 
G  j2(n+l)  =  G  |2(n)  V  o(du(G  ||(n))) 
G  2i(n+l)  =  G  2i(n)  V  o(d22((7  22("W) 
G  22(n+l)  -  G  22(n)  V  o(d2i(G  2i(n))). 

with  initial  conditions 

Gii(O)  =  G2i(0)=  fl  X  Hj 

G22(0)  =  G  j2(0)=  f  2  X  H2. 


(41a) 
(41b) 
(41c) 
(41  d) 


(41c) 


(41  f) 


As  before,  thc»-  evolutions  occur  in  a  lattice  of  o-fields  where  the  maximal  cldnent  is  a  finite  o-field. 
Hence,  repeating  the  logic  of  the  proof  of  proposition  4  establishes  that  the  consensus  process  will 
reach  steady-state  a  i  cr  a  finite  number  of  iterations.  Denote  this  finite  number  as  T. 


r,  *vjii  <•.  "V  V,  -r.  . 


To  esViblish  the  ^pe  of  outcomes  possible,  we  must  examine  the  consensus  process  closely. 

At  st’tgc  1,  DM  I's  decision  is  uu(l) «  dj  j(y  |).  This  message  is  transmitted  to  DM  2,  who  must 
interpret  this  message  according  to  his  own  subjective  decision  model.  That  is,  he  must  find 
realizations  ( i',  y  )  of  possible  models  and  observation  values  such  that  u  j  jf!)  =  d2i(y  *').  For  a 
consistent  interpretation,  one  must  have 

Pn>b{(y*'|uii(l)  =  d2i(yb.ii  =  i’}  >  0.  (42) 

If  diis  is  not  possible,  2  will  discover  that  the  decision  models  are  inconsistent,  leading  to  outcome 
2.  Otherwise,  DM  2  selects  U22(1)“  d22(y^»“ll(l))- 

At  tl.is  stage,  DM  1  must  interpret  consiste  ttly  the  communications  heard  from  DM  2.  As 
before,  he  must  find  realizations  ( j',  y^  ')  of  possible  models  and  observation  values  such  that  U22(l) 
“  di2(y^’»“l  ^  consistent  interpretation,  one  must  have 

Prt>b{  ( y2*  I  U22(l)  -  di2(y^*.ui  l(l)).  *2  -  J"  >  >  0-  (43) 

Define  (n,  i),  P^(n  j)  as  follows: 

P^n4)  »  ?•{  ( y !«! lO)  *d2i(  y*’,U22(l).-  •  •»  0220))*  *1  “  V 

for  all  j  <  n  }  (44  a) 

P2(n,j)  »  P'{  ( y2lu22(j)  =di2(  id)*-  •  -  ui*(j)),  i2  =  j, 

for  all  j  <  n  }  (44  b) 

It  is  easy  to  see  that,  for  each  i  e  I ,  P^(n4)  and  P^n4)  arc  monotone  decreasing  sequences  in  n. 

Since  the  consensus  process  reaches  steady-state  after  a  finite  number  of  communications  (for  n  ^  T), 
there  arc  three  possible  outcomes: 

1.  There  exist  no  i  or  J  such  that  both  (45) 

p‘(T4)Pi(i,i2)  >0 
p2(Tj)p2(i,,j)  >0 


(46) 


2.  There  exist  one  i  and  one  j  such  that 

pl(T4)Pi(i.i2)  >0 
p2(To)P2(ii.j)  >0 

3.  There  exist  more  than  one  iorjsnch  that  (47) 

p‘(T,i)Pi(M2)  >0 
p2(To)P2(iij)  >0 

When  eq.  4S  bolds,  the  inconsistencies  amoog  the  statistical  models  Pj  and  P2  are  detected  in 
the  consensus  process.  It  is  possible  that  the  true  probability  model  used  by  1  was  not  considered 
possible  in  DM  2's  subjective  distribution.  In  this  case,  for  some  n,  either  d«2(T)  ift  d22(T)or 
d2i(T)  ^djjCn. 

When  equation  46  holds,  either  a  consensus  outcome  will  be  reached  for  oo  e  Q,  or  an 
inconsistency  in  the  undeiiying  probability  models  P'  and  PJ  will  be  discovered.  If  i  -  j,  this  is  the 
case  analyzed  in  [IH^l;  in  this  case,  the  results  of  [5]  guarantee  that  dj  j(T) »  d}2(T)  «  d2i(T)  * 
d22(T),  so  a  consensus  outcome  is  reached.  If  i  #  j,  this  is  the  case  analyzed  in  [6]  and  [7].  In  this 
case,  two  types  of  outcomes  arc  possible:  cither  d22(T)  *  ^22^  d2i(T)  1*  dj  j(T),  so  that  an 
inconsistency  in  models  P*  and  pJ  is  detected,  or  both  d|  |(T)  =»  d22(T)  and  d22(T) «  d«2(T).  For  the 
second  outcome,  the  results  in  [6]  and  [T\  imply  also  that  d|  ](T)  >  d22(T). 

When  eq.  47  holds,  there  is  residual  ambiguity  in  both  the  statua  :al  models  and  die  undenying 
probability  models.  As  discussed  in  example  3,  it  is  possible  to  have  d]2(T)  d22(T),  d2i(T)  => 

d}  |(T),  and  dj  2(T)  ^22^  ^  addition  to  the  other  two  outcomes.  In  this  case,  no 

inccmsistencies  have  been  discovered  between  either  the  statistical  models  P|  and  P2,  or  the 
underlying  probability  models  P  j  and  P‘2-  Rather,  the  decisionmakers  hive  reached  a  stage  where  no 
additional  information  will  be  exchanged  in  the  consensus  process.  At  this  stage,  the  DMs  agree  that  c 
consensus  cannot  be  reached,  and  discontinue  the  process. 

The  above  discussion  has  established  the  following  proposition: 

Proposition  8:  Assume  A6  is  satisfied,  and  that  the  differences  between  P|  and  P2  are  secret 
knowledge  to  both  decisionmakers.  Then,  the  consensus  process  will  reach  one  of  three  possible 
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outcomes  after  a  finite  number  of  communications; 

1.  A  consensus  is  reached, 

2.  DM  1  and  DM  2  realize  that  their  underlying  statistical  models  are  inconsistent, 

3.  DM  1  arxl  DM  2  agree  to  disagree  because  they  cannot  gather  any  additional  infonnation 
from  the  consensus  {Kocess. 

A  result  similar  to  Proposition  8  was  obtained  in  [7].  However,  when  each  DM  considers  a  set 
possible  underiying  probability  models  for  the  other  as  is  the  case  in  this  section,  the 
consensus  process  can  result  in  an  outcome  not  predicted  by  the  model  of  [7];  namely,  the  can 
agree  to  disagree  even  though  their  underlying  probability  models  have  not  been  established  as 
inconsistent  widi  each  oth^.  This  point  is  illustrated  by  the  following  example. 

Example  4:  Let  ),  K  y'2>  ^  be  defined  as  in  examples  2  and  3.  Let  J((0,u)  be 

defined  as  in  eq.  13,  and  let  U  =  [0,1].  Let  I  =  { 13,4}  be  the  set  of  possible  probability  model 
indices.  Consider  the  decision  rule  defined  in  eqs.  7  and  8.  Assume  that  i  j  «  1,  and  Pj(l,l)  « .3, 
P}(13)  *  0,  P  j(l,4) « .3,  so  that  1  believes  2  is  using  either  model  1  or  nuxtel  4  with  equal 

probability.  Assume  12  «  4,  and  P2(l,4)  «  0,  P2(3,4)  -  .1,  P2(4,4)  » .1,  so  that  DM  2  assumes  that 
DM  1  is  using  either  model  3  or  model  4  with  equal  probability.  Assu^ne  that  me  a ^  n  bg  and  DM  1 
communicates  furst 

The  first  tentative  decision  of  DM  1  isu{}(l)s:3.  According  to  1^2,  if  DM  1  was  using  model  P^, 
thenu2i(l)».3  when(aea],and.25whenci)ea2.  IfDM  1  was  using  model  P^,  then  U2j(l)# 

.25  for  any  <0.  Hence,  DM  2  believes  i|  =  3,  and  me  a j  n  bg.  According  to  his  own  model,  P^, 

DM  2’s  communication  is  U22(l)  3. 

In  communicating  Uj  |(1)  =  J,  DM  1  believes  that  he  has  signaled  that  ij  =  1  and  m  e  aj .  Hence,  DM 
1  expects  Uj2(l)  =  .5  if  i2  =  1  and  m  e  b|,  or  if  i2  =  4  and  m  e  b3.  Consequently,  DM  1  chot^es 

uji(2)  =  pJ(X|ainb,)  P‘(ainbi)Pi(l,l)  +  pJ(XIajf^b3>P‘(ainb3)Pi(i,4) 

P‘(ainbi)Pi(l,l)  +  P*(ainb3)Pi(l,4)} 

=  .375 

Since  models  and  have  the  same  distribution  conditioned  on  a| ,  it  is  also  true  that  U2|(2)  = 


.375.  Since  DM  2  believes  he  already  knows  i|  ==  3,  and  o)  e  a|  rt  1)3,  he  learns  no  additional 
information,  so  his  decision  continues  to  be  U22(2)  -5.  This  decision  conveys  no  additional 
information  to  DM  1 ,  so  the  consensus  ]:Mt>cess  stalls  at  this  point,  and  both  decisionmakers  agree  that 
a  consensus  cannot  be  reached. 


5.  CONCLUSION 

In  this  paper,  we  have  studied  the  problem  of  reaching  a  consensus  in  a  grouped 
decisionmakers  by  exchanging  tentative  decisions  using  a  Bayesian  framewoik.  When  the 
decisionmakers  have  different  pn>bability  models  and  the  existence  of  those  diffoences  is  secret 
knowledge,  the  results  of  Teneketzis  and  Varaiya  [6],  [7]  characterized  all  possible  outcomes  of  the 
consensus  process  into  two  types  of  outcomes: 

1.  Reaching  a  consensus  decision  for  the  group, 

2.  Reaching  a  contradictior:. 

The  results  of  section  2  shed  additional  insight  concerning  how  likely  each  of  these  outcomes  is.  By 
defining  the  concepts  of  a  generic  outccmie  and  a  continuous  outcome,  we  have  shown  that,  when  the 
decision  space  is  continuous-valued  'jod  stnne  regularly  conditions  are  met,  reaching  a  contradiction  is 
a  generic  outcome.  In  contrast,  wnen  the  decision  space  is  discrete-valued,  both  outcomes  are 
continuous,  so  that  small  deviadons  in  probability  models  result  in  the  same  outcomes. 

One  of  the  limitadons  of  the  results  of  Teneketzis  and  Varaiya  is  their  assumpdon  that 
knowledge  that  probability  models  could  be  different  is  secret  knowledge  to  the  decisionmakers, 
although  in  fact  the  probability  models  are  different  If  the  decisionmakers  are  humans,  subject  to 
various  biases  and  inaccuracies  in  evaluating  probabilities  [9],  knowledge  that  there  can  exist 
differences  in  probability  models  is  best  modcic*!  as  common  knowledge.  In  section  4,  we  developed 
a  Bayesian  framework  whereby  this  knowledge  is  represented  as  common  knowledge,  and  the  specific 
individual  probability  models  are  represented  as  private  information  for  each  decisionmaker.  In  this 
framework,  the  consensus  process  serves  both  to  reveal  information  concerning  the  probability  model 
of  each  decisionmaker,  as  well  as  information  concerning  the  problem  uncertainty.  A  surprising  result 
is  that,  even  when  the  probability  models  of  the  decisionmakers  are  identical,  and  selected 
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independently  from  identical  probability  distributions,  th^r  -nc  two  possible  outcomes: 

1.  A  consensus  was  reached. 

2.  A  point  was  reached  where  both  decisionmakers,  on  the  basis  of  common  information, 
agree  that  a  consensus  cannot  be  reached. 

The  second  outcome  has  not  been  predicted  by  the  previous  formulations  [1]  •  [7].  Indeed,  it  seems 
to  contradict  die  title  of  Geanakoplos  and  Polemarchakis  [4],  ”y/e  can't  disagree  forever.”  Our  results 
in  section  4  show  that  merely  admitting  the  possibili^  that  the  probability  models  are  different  is 
suffic^t  to  generate  the  second  outcome.  Again,  we  characterize  how  likely  these  outcomes  are  for 
both  continuous-valued  and  discrete-valued  decision  spaces  U. 

In  conclusion,  we  have  shown  that,  in  our  Bayesian  framework,  when  the  decisionmakers 
bring  human  biases  and  inaccuracies  in  probability  assessments  into  the  consensus  process,  a 
consensus  may  not  be  reac'  ^  even  if  the  decisionmakers  share  the  same  probability  model  The 
results  depend  explicidy  on  the  Bayesian  formulation  for  incorporating  uncertainty  cooceming  the 
other  decisionmaker’s  true  probability  model.  A  different  formulation,  sin^ar  to  Kreps  and  Wilson's 
formulation  for  sequential  games  [11],  could  be  developed  whereby  each  decisionmaker  considers 
only  the  most  likely  interpretation  of  the  results  as  the  basis  for  selecting  his  tentative  decisions.  The 
merits  of  each  formulation  rest  ultimately  in  their  ability  to  help  us  understand  the  behavior  of  humans 
in  consencus  decisionmaking. 


Fig.  1  b.  Event  X  in  the  example 
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ABSTOACT 


Subjective  games  of  incomplete  information  are  formulated  where  some  of  the  key  assumptions 
of  Bayesian  games  of  incomplete  information  arc  relaxed.  The  issues  arising  because  of  the  new 
formulation  are  studied  in  the  context  of  a  class  of  non-zero-sum  two-person  games,  where  each 
player  has  a  different  model  of  the  game.  Two  types  of  games  are  investigated:  static  games  and 
infinitely-repeated  games.  It  is  shown  that  in  the  static  game  counterintuitive  outcomes  of  the  game 
occur  because  of  the  different  beliefs  of  the  players,  and  that  these  outcomes  indicate  to  the  players 
that  their  models  were  different  When  the  game  is  repeated  infinitely  often,  it  is  shewn  that  this 
repetition  can  alleviate  the  differences  in  the  models  of  the  players  and  lead  to  eventual  cooperation. 
When  multiple  equilibrium  solutions  are  present,  the  effect  of  various  bargaining  theories  on  the 
outcome  of  the  game  is  investigated.  It  is  shown  that,  depending  on  the  bargaining  model,  the 
players  may  agree  on  the  outcome  of  the  game  or  they  may  realize  that  they  have  different  models. 


SECTION  1:  INTRODUCTION 


Game  theory  is  the  mathematical  science  which  shidies  decisionmaking  in  situations  of 
potential  conflict  among  decisionmakers.  The  requirements  of  formal  game  theory  are  strict 
regarding  the  rule'  of  the  game  and  the  portrayal  of  exogenous  uncertainty.  Due  to  these 
requirements,  there  are  many  strategk  situations  which  cannot  be  initially  modeled  as  games 
because  players  lack  information  about  available  strategies,  utility  functions  or  outcomes  resulting 
from  various  strategies. 

Speciflcally,  the  key  requirements  of  formal  game  theory  are: 

Al.  The  rules  of  die  game  are  common  information  [15]  to  all  players  of  the  game 

A2.  Exogenous  uncertainty  is  portrayed  by  objective  probabilities  which  are  common 
knowledge  to  all  players. 

A3.  Players  are  fuUv  committed  to  a  priori  strategies 

A4.  Players  are  rational. 

As  Game  theory  developed,  attempts  were  made  to  relax  some  of  these  assumptions. 
Assumptic  n  A3  was  a  consequence  of  the  normalization  principle  of  Von  Neumann  [12];  Aumann 
and  Maschler  [ !]  were  the  first  to  point  out  via  a  simple  counterexample  the  inai^xropriateness  of 
the  iKirmalization  principle  under  certain  conditions;  since  then,  considerable  developments 
followed  by  relaxing  the  assumption  of  priw  conunitment  [2]-[6]. 

Harsanyi[7]  arxl  Aumann-Maschler  et  al.  [8]  pointed  out  that  in  some  military  juxiblems, 
idayers  may  lack  full  information  about  the  payoff  functions  of  other  flayers,  or  about  the  physical 
facilities  and  stiategies  of  other  players,  or  even  about  the  amount  of  information  that  other  players 
have  about  the  various  aspects  of  the  game  situation.  Thus,  Harsanyi  [7]  first  relaxed  assumption 
A I  and  formulated  and  developed  models  of  games  of  incomplete  infomiation.  Harsanyi  modeled 
the  incomplete  information  as  an  exogenous  random  move  (Nature’s  move)  to  select  among 
possible  games;  he  also  assumed  that  the  outcomes  of  this  move  have  a  (subjective)  probability 
distribution  which  is  common  knowledge  to  all  players.  Considerable  progress  has  been  achieved 
in  the  theory  of  games  of  iiKomplete  information  using  Harsanyi’s  original  formulation  (see 
(81-[  10)  and  references  therein.) 


A  restriction  in  Harsanyi's  formulation  is  the  requirement  of  common  knowledge  of  the 
probability  distribution  of  Nature's  move.  In  many  strategic  situations  (especially  in 
noncooperadve  games),  this  distribution  is  subjectively  assessed  by  each  player,  and  subject  to 
individ'iJ  biases  and  inaccuracies  [27].  In  this  paper,  we  formulate  a  class  of  games,  which  we 
call  Subjective  Games  of  Incomplete  Infomaticn,  which  relaxes  Harsanyi's  requirements  of 
common  knowledge.  Specifically,  we  allow  each  person  to  have  his  own  subjective  probability 
distribution  of  nature's  move;  in  addition,  each  person  believes  his  subjective  distribution  is 
common  information,  whereas  it  is  actually  secret  information  [IS].  As  a  consequence, 
requirements  A1  and  A2  are  relaxed,  and  requirement  A4  is  modified  in  die  sense  that  each  fdayer  is 
considered  to  be  rational  within  his/her  own  subjective  view  of  the  game. 

Various  interesting  issues  arise  because  rtf’ our  formulation; 

QI.  How  are  equilibrium  strategies  defined  for  subjective  games? 

Q2.  How  do  these  equilibrium  strategies  relate  to  the  equilibrium  strategies  of  the  games 
studied  so  far? 

Q3.  Does  repetition  of  the  game  result  in  cooperation  as  in  the  case  of  the  games  studied  so 
far  (e.g.  [11])?  Does  repetition  of  the  game  alleviate  differences  in  die  subjeedve 
assessments  of  the  players  and  allow  players  to  agree  or  an  equilibrium  strategy? 

Q4.  Is  it  possible  to  characterize  the  set  of  all  equilibria  for  repeated  subjective  games? 

To  understand  some  of  these  questions,  we  shall  consider  a  special  class  of  games,  namely 
2x2  two-person  non-zero  sum  games  of  incomplete  information  where  the  payoff  matrices  have  a  . 
special  structure. 

The  rest  of  this  paper  is  organized  as  follows:  In  section  2,  we  present  die  model  for 
subjective  games,  and  briefly  discuss  games  of  incomplete  information  and  point  out  the 
differences  between  Harsanyi's  model  and  our  model.  In  section  3,  we  study  static  subjective 
non-cooperative  games  of  incomplete  information.  In  section  4,  we  consider  infinitely  repeated 
non-cooperative  subjective  games  of  incomplete  information.  Conclusions  are  presented  in  section 
5. 
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SECTION  2:  FORMULATION  OF  SUBJECTIVE  GAMES  OF  INCOMPLETE  INFORMATION 


We  shall  develop  our  theory  of  subjective  games  based  on  the  following  key  assumptions; 

51.  Players  have  different  probability  assessments  on  nature's  move. 

52.  Each  player  thinks  that  the  other  players'  a  >sessments  are  the  saine  as  his. 

53.  Players  are  Bayesian. 

54.  Each  player  is  rational  within  his  tiwn  subjective  view  of  the  game. 

Assun^)don  S2  imf^ies  that  die  rules  of  the  game  are  not  common  knowledge  to  all  the 
players,  since  each  frfayer  thinks  that  the  other  pl^^ers'  assessments  are  the  same  as  his,  yet  this 
may  not  be  true.  Assumptions  Si  and  S2  were  previously  used  in  the  context  of  distributed 
estimation  and  detection  [14]. 

More  precisely,  let  b|[  represent  the  private  information  of  i^ayer  i  about  the  game.  This 
information  relates  to  the  outcome  of  nature's  move.  In  d^ing  widi  incomplete  information,  each 
player  takes  a  Bayesian  approach.  That  is,  each  player  assigns  a  subjective  probability  distribution 

Pi 

Pi  »Pj  (b|,b2,...,bj,...bn) 

to  nature's  move  and  attempts  to  nuximize  the  mathematical  expectation  of  his  own  payoff  Jj  in 
terms  this  probability  distribution.  Furthermore,  each  player  i  assumes  that  Pj  =Pj  for  all  j, 
whereas  in  the  actual  game,  Pj  and  Pj  may  be  different 

Comparing  the  mathematical  model  described  above  with  Harsanyi's  formulation,  we  note 
tlu.i  Harsanyi  also  assumes  that  each  player  assigns  a  subjective  probability  distribution  Pj  to 
;  ^'s  move;  although  Pj  and  Pj  may  differ,  all  the  distributions  Pj  are  assumed  to  be  common 
».  jwledgc  to  all  players.  In  our  formulation,  any  difference  in  subjective  probabilities  is  secret 
infer, i^aticm.  Moreover,  each  player  is  unaware  that  he  has  secret  information. 
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SECTION  3: 

STATIC  SUBJECTIVE  NON-COOPERATIVE  GAMES  OF  INCOMPLETE  INFORMATION 


3.1  Rroblem  formulation 


We  consider  the  following  static  two-person  non-zero  sum  game.  Nature  selects  one  of  two 
games  with  the  following  payoff  matrices; 


Game  1: 


X  M 


It  (bfi) 


Game  2: 


X  (b,b) 


(a,d) 


We  further  assume  that 


a  >  c  >  b  >d 


b+c  >  a  +  d 


Player  1  can  choose  action  X  or  ^  and  player  2  can  choose  action  o  or  T.  Note  that,  because  of 


(3-3),  each  player  has  a  dominant  strategy  in  each  one  of  the  two  games.  So  far,  the  statement  of 
the  pi  blem  and  the  assumptions  (3*3)-(3  4}  are  essentially  the  same  as  in  |15].  However, 
contrary  to  [ IS],  we  now  assume  that  the  two  players  have  a  different 
probability  assessment  of  nature's  move.  Let  r  be  the  true  probability  that  nature  selects  game  1. 
Let  p,  q  be  player  I's  and  player  2's  assessments  of  this  event  respectively.  Assume  p>l/2,  q<l/2. 


We  will  consider  this  problem  under  four  different  types  of  information  that  a  player  may 
receive; 

1.  No  information:  In  this  case,  none  of  the  players  is  informed  about  the  outcoihe  of  nature's 
move. 

2.  Public  information:  In  this  case,  both  players  are  informed  about  the  outcome  of  nature's 
move. 

3.  Private  information:  In  this  case,  one  player  is  informed  about  the  outcome  of  nature's 
move,  whereas  the  other  player  is  not.  Moreover,  this  distribution  of  information  is 
common  knowledge. 

4.  Secret  information:  In  this  case,  one  player  is  informed  ibout  the  outcome  of  nature's 
move  whereas  the  other  player  is  uninformed.  Moreover,  the  uninformed  player  is 
unaware  that  his  opponent  is  informed,  and  the  informed  player  knows  this. 


The  rational  strategies  in  each  of  these  situations  are: 


1.  No  Information 

In  this  case,  player  1  plays  X  and  player  2  plays  t.  The  payoffs  of  the  two  players  are; 
jOj=rc  +  (l-r)d,  (3-5) 


and 


J®2= 


(3-6) 


2.  Public  information 

In  this  case,  player  1  plays  X  in  Game  1  and  |i  in  game  2.  Player  2  plays  a  in  game  1  and  t  in 
game  2.  Thus,  the  payoff  of  the  players  is 


“ra  +  (l-r)c. 


(3-7) 


Define  the  value  of  information  as  follows:  Vj ,  the  value  of  information  to  player  i,  is  the  payoff 
of  player  i  when  he  knows  the  outcome  of  nature’s  move  minus  the  payoff  of  player  i  when  no 
player  is  informed  about  the  outcome  of  nature's  move. 

In  this  case,  the  value  of  public  information  for  players  1  and  2  is  given  by 


V®1  -r(a-c)  +  (l-r)(c-d) 

(3-8) 

V®2  ■(2r-l)a+(l-r)c-rb 

(3-9) 

3.  Private  Information 

■ 

3a.  Assume  at  first  that  player  1  is  the  informed  player.  Then  he  plays  a  in  game  1  and  ^  in 
g2une2.  Player  2  plays  X.  The  payoffs  of  the  two  players  are 

jPl  =c 

(3-10) 

JP2  =  rb  +  (1-r)  c 

(3-11) 

3b.  If  player  2  is  the  informed  player,  then  he  plays  u  in  game  1  and  T  in  game  2.  If 

p  <  (a-d)/(a-K:-b-d)  =  p* 

(3-12) 

then  player  1  will  play  p.  Otherwise,  he  will  play  X.  The  expected  payoffs  for  player  1  are  then 

=  rb  +  (l-r)c 

(3-13) 

J^2  =  *^  +  (l-f)d 

(3-14) 

resjjectively. 
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•  -jt  -  v  -f  -r.  r. 


The  payoff  for  player  2  is 


1^2=0  (corresponding  to  p)  (3-15) 

(corresponding  to  X)  (3-16) 

The  value  of  infcrmadon  for  the  two  players  is: 

yPi-d-rXc-d)  (3-17) 

V^2  *  c*  ^  if  p<p* 

(3-18) 

?!  r(a-b)  other^rise 
4.  Secret  Information 


Assume  at  first  that  player  1  is  secretly  informed  about  the  outcome  of  die  chance  move. 
Then,  he  plays  X  in  game  1  and  p  in  game  2.  Player  2  plays  t.  The  payoffs  of  the  two  players  are 

-  c  (3-19) 

J^2  =  *  +  (3-20) 

The  value  of  secret  information  to  player  1  in  this  case  is 

vSj=(l.r)(c-d)  (3-21) 

Assume  now  that  player  2  is  secretly  informed.  Then,  he  plays  a  in  game  1  and  x  in  game  2. 
Player  1  plays  X.  The  payoffs  of  the  two  players  are 

=ra  +  (l-r)d  (3-22) 


(3-23) 
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(3-24) 


The  value  of  secret  information  to  player  2  in  this  case  is 
vSj^rfa-b) 


Let  us  discuss  scnne  interesting  features  of  the  solutions  of  these  games.  At  first,  note  that 
each  payoff  bimatrix  is  symmetric,  hence  in  each  one  of  tit'j  two  games,the  players  aiC 
interchangeable.  Thus,  one  expects  that  for  the  classk^al  Bayesian  game,  in  die  case  public  or 
secret  information,  the  behavior  of  the  informed  and  the  uninformed  player  will  be  independent 
udm  is  the  infumed  and  who  is  the  uninformed  pls^er.  For  example,  in  the  case  of  private  or 
secret  informadon,  if  player  1  were  the  uninformed  player  and  played  X,  we  would  expect  that  if 
the  situation  were  reversed  and  player  2  became  the  uninformed  player  !<e  would  play  a.  Also,  in 
the  case  where  no  player  was  informed  about  the  outcome  of  the  chance  move,  the  dominant 
strategies  would  be  (X,o)  or  (p,x).  Consequently,  the  value  of  private,  secret  or  public  information 
would  be  the  same  for  both  players.  It  can  be  easily  checked  that  this  is  indeed  the  case  when 
p-q>r.  However,  this  behavior  is  not  observed  when  each  player  has  his  own  subjective  model  of 
the  game.  When  player  1  is  jnivately  informed  about  the  chance  move,  player  2  always  chooses  x 
(the  second  column);  on  the  other  hand,  if  player  2  is  privately  informed  about  the  outcome  of  the 
chance  move,  player  1  does  not  always  play  p  (the  second  row).  When  player  1  is  the  secretly 
informed  player,  player  2  always  plays  x  (second  column);  if  player  2  is  the  secretly  informed 
player,  player  1  always  plays  X  (first  row).  When  no  player  is  informed  about  the  outcome  of  the 
chance  move,  the  outcome  of  the  game  is  (X,x).  These  facts  indicate  that  the  value  of  private  and 
secret  information  is  now  different  fmr  each  player,  as  is  evident  from  the  analysis  above. 

Fix'  the  class  of  games  considered  in  this  section,  ihe  value  of  public,  private  and  secret 
information  differs  from  player  to  player,  whereas  in  the  classical  Bayesian  framework,  this  value 
does  not  depend  on  who  is  the  informed  and  who  is  the  uninfa'med  player.  This  phenomenon  is 
due  to  the  differences  in  the  initial  probability  assessments  of  the  incomplete  information. 

Another  inteiresting  observation  follows  from  the  previous  results.  Consider  the  case  where 
player  2  is  privately  informed,  p  <  p*,  and  r  =  1/2.  Then,  the  value  of  information  for  player  2  is 
given  by 
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vP2»c-Jb-Ja 

if  c<  3  a  +  .5  b,  the  value  of  private  information  for  player  2  is  negative!  On  the  other  hand,  the 
gain  for  player  1,  the  uninformed  player,  is  equal  to  .5(b-d)  which  is  positive.  Thus,  for  the  class 
of  symmetric  games  considered  in  this  paper,  we  have  a  case  where  the  value  of  private  information 
is  negative  for  the  informed  player  and  the  uninformed  player  benefits  from  the  situation!  This 
phenomenon  never  occurs  for  this  class  of  games  in  the  classical  Bayesian  framewoilc,  where  if  the 
value  oi  private  information  is  negative  for  the  infom^  player,  the  uninformed  player  cannot 
benefit  either  [15].  Even  man  surprising  in  this  case  is  the.  fact  that  the  informed  pl^er  wants  to 
use  his  private  information,  whereas  the  uninformed  player  wishes  that  the  infcmiied  player  acted  as 
if  he  were  not  inftxmed! 

The  reason  for  all  these  counterintuitive  results  and  the  differences  between  the  subjective 
game  results  and  the  classical  Bayesian  game  results  is  that  each  player  evaluates  the  game  as  well 
as  the  behavior  his  opponent  in  the  game  in  terms  of  his  own  model  and  acts  accordingly.  Such 
subjective  evaluations  lead  to  behavitK*  which  would  never  occur  in  the  classical  Bayesian 
formulation  as  evidenced  by  the  previous  aiudysis. 

One  issue  that  naturally  arises  in  these  games  is  the  following:  How  do  the  players  involved 
in  the  game  interpret  its  outcome?  Do  they  realize  that  they  have  different  models?  If  neither  player 
is  informed  about  the  outcome  of  the  chance  move  player  1  expects  that  player  2  will  use  strategy  a 
and  player  2  expects  that  player  1  will  use  strategy  p.  At  the  end  of  the  game,  each  player  finds  out 
that  the  outcome  is  the  opposite  of  what  he  expected.  Since  each  player  assumes  that  his  opponent 
is  rational,  both  players  conclude  that  they  have  different  models.  Similar  phenomena  occur  if  one 
of  the  players  is  either  secretly  or  privately  informed. 

In  the  case  of  secret  information,  the  secretly-informed  player  discovers  at  the  end  of  the  game 
that  his  opponent's  perception  of  the  game  is  different  from  his.  On  the  other  hand,  the  uninformed 
player  may  never  discover  that  his  opponent  has  a  different  perception  of  die  game,  or  he  may  not 
be  able  to  interfHet  his  opponent’s  move  in  terms  of  his  own  model  in  which  case  he  can  conclude 
that  either  his  opponent  has  a  different  model  of  the  game,  or  his  opponent  has  ser.itt  information. 

In  the  case  of  private  information,  the  uninformed  ph’yer  is  not  in  a  position  to  discover  at  the 
end  of  the  game  that  his  opponent  has  a  different  view  of  the  game.  The  informed  player  may  or 
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may  not  discover  at  the  end  of  the  game  that  he  and  his  (^ponent  have  inconsistent  beliefs  about  the 
game,  depending  on  whether  eq.  (3-12)  holds.  Note  that  if  both  p,q  >  1/2  or  p,q  <  1/2,  the  players 
never  discover  the  differences  in  their  models. 

In  this  section,  we  presented  and  analyzed  a  simple  class  of  two-person  non-cooperative 
nonzero  sum  one  stage  subjective  games  of  incomplete  information,  and  showed  how  the 
inconsistent  beliefs  of  the  players  lead  to  counterintuitive  behavior.  An  important  issue  which  has 
not  been  discussed  so  far  is  whether  the  differences  in  beliefs  between  the  two  players  are  amplified 
or  smoothed  out  if  the  game  is  repeated  over  aitd  over.  We  address  this  issue  in  the  next  section. 
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SECTION  4; 

INFINrrELY  REPEATED  SUBJECTIVE  GAMES  OF  INCOMPLETE  INFORMATION 


In  this  section,  we  consider  the  infinitely  repeated  version  of  the  class  of  games  studied  in 
section  3.  First,  we  study  the  situation  where  no  player  is  informed  concerning  the  outcome  of 
Nature's  move.  Then,  we  consider  the  case  of  private  information.  For  this  class  of  games,  we 
show  that  repetition  can  alleviate  differences  between  subjective  models  and  lead  to  agreement 
about  the  outcome  of  the  game.  In  addition,  we  show  that  tlw  value  of  private  information  is 
always  positive  in  this  infinitely  repeated  game.  Finally,  we  examine  the  effect  of  various 
bargaining  models  on  the  outcome  of  the  game. 

Before  we  proceed  with  the  analysis,  let's  define  the  meaning  of  a  solution  to  an 
infinitely-repeated  nonzero  sum  two-person  game.  According  to  the  results  of  [1 1],[16],[17],  the 
set  of  equilibrium  outcomes  of  the  infinitely  repeated  game  are  all  the  payoffs  which  are 
individually  rational  [16].  Among  these  equilibrium  outcomes,  the  set  of  efficient  equilibria  (also 
known  as  the  core  equilibria  of  the  game)  are  the  set  of  outcomes  which  are  also  Pareto  optimal.  A 
pair  of  equilibrium  strategies  in  the  infinitely-repeated  game  will  be  a  solution  if  and  only  if  it 
produces  payoffs  among  the  v'ore  equilibria  of  the  infinitely-repeated  game. 

4.1  No  Information 

Assume  that  neither  player  has  informadon  concerning  Nature's  move.  In  this  case,  the 
one-stage  subjective  expected  payoff  matrix  of  each  player  can  be  computed  as  in  the  previous 
section.  The  set  of  obtainable  payoffs  for  players  1  and  2  are  given  in  figures  4-1  and  4-2 
according  to  player  I's  perception,  and  figures  4-3  and  4-4  acceding  to  player  2’s  perception.  The 
i'ldividually  rational  outcomes  for  each  player  have  been  outlined  in  the  shaded  areas  of  the  figures. 
It  is  easy  to  see  that  the  set  of  equilibrium  outcomes  of  the  infinitely  repeated  game,  as  perceived  by 
the  two  players,  do  not  coincide.  Moreover,  the  set  of  core  equilibria  are  different  according  to  the 
perception  of  the  two  players.  As  far  as  player  1  is  concerned,  the  set  of  core  equilibria  contains 
only  one  possible  ou'xome,  (pa  +  (l-p)b,  pa  +  (l-p)b),  achieved  by  the  strategy  (X,a).  The  core  of 
the  game  for  player  2  consists  of  all  the  points  of  the  lines  A  B  and  BC. 

Let  S’  denote  the  set  of  equili">rium  strategy  pairs,  as  perceived  by  player  i,  i=  1 ,2.  If  either 


-  j 


11 


player  plays  a  strategy  not  in  S*,  then  the  other  player  will  detect  an  immediate  inconsistency 
between  the  player's  models  and  an  equilibrium  will  not  be  reached.  We  can  now  define  what  are 
equilibrium  strategies  in  this  class  of  games. 

Definition:  The  set  S  of  equilibrium  strategies  of  the  infinitely-  repeated  subjective  game  of 
incomplete  information  is  the  intersection  of  and  S^. 

An  immediate  result  follows  from  this  definition: 

Lemma  4.1  If  a  pair  of  strategies  e  S,  the  players  never  realize  that  they  have  ''afferent 
models. 

Eioofi 

For  any  in  S,  each  7*  [1 1]  consists  of  two  parts:  The  strategy  player  i  implements  as  long 
as  the  other  player  does  not  deviate  from  his  announced  strategy,  and  the  threat  player  i  implements 
if  the  other  player  deviates  frum  his  announced  strategy.  We  must  show  that,  whether  deviations 
fipom  the  announced  strategy  occur  or  not,  no  player  finds  out  that  they  have  different  models. 

Assume  that  both  players  follow  their  announced  equilibrium  strategies.  Then,  since  e.\  S, 
the  rciulting  payoffs  are  individually  rational  according  to  each  player’s  perception,  so  the  players 
canrot  detect  that  they  have  different  models. 

Assume  that  player  1  (2)  deviates  from  his  announced  equilibrium  strategy  and  player  2(1)  detects 
this  deviation.  Player  I’s  (2's)  threat  consists  of  selecting  the  strategy  which  reduces  player  2’s 
(I’s)  payoff  the  most;  this  strategy  is  p  (x)  From  player  2’s  (I's)  perspective,  player  I’s  (2’s) 
threat  strategy  is  also  p  (X).  Hence,  when  a  deviation  occurs  the  players  cannot  detect  that  they 
have  different  models,  because  their  opponent's  threat  strategies  are  the  same  as  those  predicted  by 
their  own  models,  q.e.d. 


4.2  Private  information 


Consider  the  infinite  repetition  of  the  game  described  in  section  3,  and  assume  that  player  1  is 
privately  informed  of  Nature’s  move.  The  main  feature  of  this  game  is  that  the  uninformed  player 
can  collect  additional  informalio  i  throughout  the  play  of  the  game  by  watching  the  behavior  of  the 
informed  player.  On  the  other  iiand,  the  informed  player  has  to  decide  what  information  (if  any)  he 
has  to  reveal  to  his  opponent  and  at  what  rate.  Various  interesting  questions  can  be  asked  about 
this  game; 

1.  Can  revelation  of  private  information  be  beneficial  for  the  informed  player? 

2.  What  is  the  value  of  private  information? 

3.  Does  repetition  alleviate  the  differences  between  the  models  of  the  two  players? 

4.  Can  the  uninformed  player  take  advantage  of  the  private  information  of  the  informed 
player? 

To  find  the  answer  to  these  questions,  we  shall  determine  core  equilibrium  strategies  of  the 
infinitely  repeated  subjective  game.  These  core  equilibrium  strategies  are  strategies  which  result  in 
core  equilibrium  payoffs  according  to  both  players’  subjective  model  of  the  infinitely  repeated 
game.  The  following  results  characterize  these  equilibrium  payoffs. 

Theoretn  4.2:  There  is  a  unique  pair  of  payoffs  corresponding  to  core  equilibrium  strategics  in  the 
infinitely  repetited  subjective  game  when  player  1  is  privately  informed  of  the  outcome  of  Nature’s 
move.  This  pair  is  {ra  +  (l-r)c,  ra  +  (l-rjc). 

Before  proving  this  theorem,  let’s  describe  how  the  core  equilibrium  payoffs  are  determined.  First, 
the  feasible  region  of  payoffs  for  the  game  of  incomplete  information  according  to  each  player’s 
perception  is  determined  as  the  payoffs  which  are  individually  rational  for  each  player[  10].  Then, 
the  core  equilibrium  payoffs  of  each  feasible  region  is  determined  by  identifying  the  Pareto  optimal 
outcomes. 

In  order  to  develop  the  proof  of  theorem  4.2,  we  will  need  the  following  results  concerning 
infinitely  repeated  nonzero  sum  games  of  incomplete  information  [  10]. 


T^emma  4.3:  Player  2  can  limit  the  actual  and  perceived  payoff  of  player  1  to  c  by  playing  the  pure 
strategy  x  throughout  the  repeated  game.  This  strategy  is  the  Blackwell  strategy  [18]  of  player  2. 


Proof:  Sw  Appendix. 

I^mma  4.4:  According  to  player  I’s  perception,  player  1  can  limit  the  payoff  of  player  2  to  pc  + 
(l-p)d.  The  strategy  achieving  this  payoff  is  the  maxmin  strategy  for  the  following  infinitely 
repeated  zero  sum  garne  of  incomplete  information: 

Game  1 
o  t 

X  -a  -b 

[I  -c  -d 

where  Prob(Nature  chooses  game  1)  =  p. 

I  .gtntna  4.5:  According  to  player  2’s  perception  player  1  can  limit  the  payoff  of  player  2  to  qd  + 
(1-q)  c.  The  strategy  achieving  this  payoff  is  the  maxmin  strategy  for  the  infinitely  repeated  zero 
sum  game  of  incomplete  information  of  lemma  4.4,  where 
Prob(Nature  chooses  Game  1)  =  q. 

Lemmas  4.4  and  4  J  are  direct  consequences  of  the  definition  of  the  threat  strategy  and  the  results 
of  zero  sum  infinitely  repeated  games  of  incomplete  infOTmation  in  [9]. 

Lemma  4.6:  According  to  player  I's  perception,  the  payoff  {pa  +  (l-p)c,  pa  +  (l-p)c}  is  an 
individually-rational  equilibrium  payoff.  AccOTding  to  player  2's  perception,  the  payoff 
(qa  +  (l-q)c,  qa  +  (l*q)c}  is  an  individually-rational  equilibrium  payoff. 

Proof:  Individual  rationality  follows  from  Lemmas  4.3  and  4.5.  The  above  payoff  is  achieved  by 
the  following  strategy:  At  the  first  stage  of  the  game  player  l(the  informed  player )  plays  X  if 
Nature  selects  Game  1  and  p  if  Nature  selects  Game  2,  thereby  revealing  Nature's  move  to  player 
2.  Player  2  can  play  any  strategy  at  stage  1 .  For  all  subsequent  stages,  if  Nature  chose  Game  1 , 
the  players  choose  (X,o).  If  Nature  chose  Game  2,  the  players  choose  (p,x).  If  either  player 
deviates,  the  other  player  implements  his  threat  strategy  of  lemma  4.3  or  4.4.  Note  that  neither 


Gaine2 
o  X 

X  -b  -a 

p  -d  -c 


player  1  nor  player  2  can  improve  his  payoff  by  deviating  from  the  announced  strategy,  since  the 
threat  stiate^  yields  a  lower  payoff  than  the  equilibrium  for  the  deviating  player;  consequently,  the 
strategies  described  above  result  in  an  equilibrium,  according  to  each  player’s  perception  of  the 
game.  The  values  achieved  by  these  strategies  are  the  values  postulated  in  the  Lemma.  q.e.d. 

Lemma  4.7:  According  to  player  I’s  perception,  the  paytrff  pa  +  (l-p)c  is  the  most  player  1  can 
achieve  in  the  infinitely  repeated  game.  According  to  player  2‘s  perception,  the  payoff  qa  -t-  (l^)c 
is  the  most  player  1  can  achieve  in  the  inflnitely  repeated  game. 

Proof:  See  appendix 

Proof  of  Theorem  4.2: 

From  Lemmas  4.6  and  4.7,  the  equilibrium  payoffs  of  Lenuna  4.6  are  individually  rational  and 
Pareto  optimal  according  to  both  players'  perceptions.  From  lemma  4.7,  these  equilibrium 
payoffs  are  the  unique  payoflfs  in  the  core  of  the  game  according  to  each  player’s  perception.  In 
ortkr  to  achieve  these  payoffs,  player  1  must  reveal  his  private  information  concerning  Nature's 
mOve.  Hence,  all  core  equilibrium  strategies  for  player  1  must  reveal  Nature's  move  in  a  finite 
number  of  repetitions.  Once  Nature's  move  is  common  knowledge,  each  player  can  use  his 
dominant  strategy  in  the  appropriate  Game,  so  that  all  core  equilibrium  strategies  result  in  the 
unique  equilibriuih  payoffs  for  the  infinitely-repeated  game,  q.e.d. 

Corollary  4.8:  Assume  both  players  follow  the  core  equilibrium  strategies  (fescribed  in  the  proof  of 
lemma  4.6.  Then,  the  players  do  not  realize  that  they  have  different  subjective  models. 

Proof:  As  lOTg  as  neither  player  deviates  from  his  equilibrium  strategy,  they  cannot  discover  the 
differences  in  their  model,  since  these  strategies  are  core  equilibrium  strategies.  Suppose  player  1 
deviates  from  his  equilibrium  strategy.  Such  a  deviadon  is  immediately  detected  by  player  2,  who 
switches  to  his  threat  strategy  x  as  described  in  lemma  4.3.  This  strategy  is  independent  of  the 
value  of  q,  hence  the  players  are  unable  to  detect  any  difference  in  their  subjective  models. 

Suppose  player  2  deviates  from  is  equilibrium  strategy.  Such  a  deviation  is  immediately  detected 
by  player  1,  who  switches  to  his  threat  strategy  as  described  in  lemma  4.4  This  strategy  is 
described  in  (191  as  follows:  A  set  of  lotteries  is  available  to  player  1.  Depending  on  the  outcome 
of  Nature's  move,  a  lottery  is  chosen  by  player  I.  This  lottery  is  performed  and  its  outcome 
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determines  the  move  of  player  1  for  the  remainder  of  the  game.  In  order  for  tlie  two  players  to 
realize  that  they  have  different  models,  the  move  of  player  1  must  have  zero  probability  according 
to  the  model  of  player  2.  Using  Uie  results  of  [19],  the  maxmin  strategy  of  player  1  is:  According 
to  player  I’s  perception,  player  1  performs  eitb^  a  lottery  whose  outcome  is  X  with  probability  1  or 
a  lottery  whose  outcome  is  X  with  probability  1/2.  According  to  player  2's  perception,  player  1 
performs  either  a  lottery  whose  outctnne  is  p  with  probability  1  or  a  lottery  whose  outcome  is  X 
with  probability  1/2.  Therefore,  even  when  player  1  switches  to  his  threat  strategy,  the  players 
never  realize  that  their  subjective  tmdels  are  different  because  there  is  no  contradictory  behavior. 
qx.d. 

Corollary  4.9:  For  the  game  of  theorem  42,  the  value  of  private  information  is  positive  for  player 

1. 

Proof:  The  value  of  private  information  for  player  1  can  be  computed  explicitly  using  theorem  4.2. 
Since  pa  +  (l-p)b  is  the  only  core  equi-  librium  of  the  inflnitcly-iepeated  game  without  private 
informatioa  for  either  i^er,  the  expected  value  oi  information  to  player  1  is  the  diffe-  rence 
between  the  payoff  of  Theorem  4.2  arxl  this  payoff.  This  value  is 

vU(i-p)(c-b)  >0. 

Note  also  that 

v2-(l-q)<c-b)  >  0, 

so  the  private  information  of  player  1  also  has  positive  value  for  player  2.  q.e.d 

In  sum,  we  have  answered  many  of  the  questions  asked  at  the  beginning  of  this  subsection. 
Specifically,  repetition  of  the  game  serves  to  alleviate  the  differences  in  the  models  of  the  players 
and  leads  to  eventual  agreement  on  a  common  pair  of  strategies,  and  the  value  of  private 
information  is  positive  for  the  informed  player,  unlike  the  results  for  the  static  game  of  the  previous 
section.  Note  that,  due  to  the  symmetry  in  the  game,  a  similar  set  of  lemmas  and  theorems  can  be 
established  if  player  2  were  the  informed  player.  When  one  player  has  private  information,  the 
perceived  core  of  the  game  for  each  player  has  a  single  pair  of  equilibrium  payoffs,  and  repetition 
of  the  game  serves  to  eliminate  the  differences  in  initial  probability  assessments,  in  contrast  with  the 


situation  when  iitither  pSaycr  has  private  information,  where  the  perceived  core  of  the  game  for 
placer  2  can  ixxiuin  many  additional  equilibrium  payors.  The  selection  of  equilibrium  strategies 
in  this  case  iS  tlvf  result  cf  bargaining  between  the  two  players  to  decide  which  pair  of  core 
equilibrium  pa,  they  will  achieve.  In  the  next  subsection,  we  examine  the  effect  of  various 
bargaining  thcc»if*s  -n  the  outcome  of  the  infinitely  repeated  subjective  game  without  private 
information. 

4.3  The  effect  of  Bar^air.ing  Tneories  on  the  Outcome  of  a  Subjective  Infinitely  Repeated  Game  of 

Incomplete  InfaTOian 

In  tins  section,  we  investigate  the  effect  of  the  bargaiiung  theories  of  Nash  [21]  and  2^uthen 
[22,23]  on  the  outcome  of  the  mfinitely  repeated  game  of  subsection  4.1.  The  core  equilibrium 
payoffs  of  the  game  according  to  player  I's  perception  are  the  single  point  (pa  +  (l-p)b,  pa  + 

( 1  -p)b).  According  to  player  2's  perception,  the  core  equilibrium  payoffs  of  the  game  are  the  pairs 
of  points  on  the  boundary  of  the  shaded  regioii  of  fig.  4-2.  We  show  that,  depending  on  the 
bargaiiung  model  used  by  the  players,  they  may  or  may  not  realize  the  differeiice  in  their  models 
during  the  course  of  bargaining. 

Theorem  4.10:  Consider  the  bargaining  problem  for  the  infinitely  repeated  game  of  Section  4.1.  If 
the  Nash  bargaining  model  with  moves  consisting  of  the  choice  of  strategies  is  used,  the  players 
will  agree  on  the  core  equilibrium  strategy  and  never  realize  the  differences  in  their  models. 

Proof:  Nash's  bargaining  modei  is  a  game  in  normal  form  where  each  plsyer  has  only  one  move. 

In  this  move,  the  playCr  announces  a  strategy  which  achieves  the  payoffs  corresponding  to  the 
Nash  bargaining  solution,  satisfying  Nash's  axioms  [21].  The  maxmin  values  which  each  player 
can  guarantee  himself  are  (pd  +  (l-p)c,  pd  +  (l-p)c)  according  to  player  I’s  perception,  and  (qc  + 
(l-q)d,  qc  +  (l-q)d)  according  to  player  2's  perception.  In  either  case,  the  maxmin  values  lie  on 
the  line  x=y,  as  do  points  B  and  B'  in  figures  4-1  and  4-2.  Hence,  the  payoffs  corresponding  to 
Nash's  bargaining  solution  correspt^  td  to  pwints  B  and  B'  in  these  figures.  The  strategies  which 
achieve  these  rayoffs  are  the  pair  of  strategies  (X,o),  acetwding  to  each  player's  perception.  Thus, 
the  players  will  agree  on  the  strategies  announced  and  never  realize  the  difference  in  their 
underlying  perceptions  of  the  game. 


In  the  above  result,  players  only  exchanged  their  final  bargaining  strategies.  When  utility  is 
lineariy  transferable  between  the  players,  Nash's  bargaining  model  with  threats  can  be  used.  In 
this  model,  players  exvAange  threat  strategies,  then  select  the  bargaining  solution  based  on  these 
threat  strategies.  The  players  can  de  tect  the  inconsistency  between  their  models  if  the  threat 
strategies  announced  are  inconsistent  with  the  players'  models. 

Theorem  4.11:  Assume  that  players  use  Nash's  bargaining  model  with  threats  and  transferable 
utilities[28].  If 

Pi  (a.-dt  b-cl  «  p* 

a-d 

dien  the  players  do  not  detect  that  they  have  different  perceptions  of  the  game.  If  p>p*,  the  players 
detect  an  inconsistency  in  their  perceptions  of  the  game. 

Proof:  In  Nash's  bargaining  model  with  threats  and  transferable  utib‘ties[28],  the  threat  strategies 
for  the  players,  according  to  player  I's  perception,  are  the  solutions  of  the  zero  sum  game 


<J  X 

X  0  p{c-b)  +  (l-p)(d-a) 

p  -p'c-b)  +  (l-p)(d-a)  0 

If  p  <  p*,  the  optimal  pair  of  threat  strategies  is  (p,x).  If  p>  p*,  the  optimal  pair  of  threat  strategies 
is  (X,o).  According  to  player  2,  since  q  <  1/2,  the  optimal  pair  of  threat  strategies  is  (p,T).  Hence, 
if  p>p*,  the  players  will  detect  an  inconsistency  in  the  announced  threat  strategies,  If  p<p*,  the 
threat  strategies  announced  by  each  player  will  be  consistent,  and  result  in  expected  payoffs  which 
are  on  the  x=y  line.  Hence,  the  Nash  bargaining  solution  will  be  the  same  as  the  previous  theorem, 
and  th'  players  will  not  detect  the  difference  in  their  perceptions  of  the  game. 

Other  bargaining  models  can  lead  to  players  discovering  the  inconsistencies  in  their  models. 
Consider  Zeuthen's  regular  bargaining  model  122)-(23|,  which  is  a  game  in  extensive  form  which 
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allows  bargaining  to  proceed  in  steps.  We  assume  again  that  the  moves  of  players  are  strategies 
corresponding  to  the  players'  payoff  demands.  Then,  if  the  players  do  not  reach  an  agreement  in 
one  move,  it  is  possible  that  player  2  (whose  perpeption  of  the  game  is  described  by  figure  4-2) 
may  propose  a  move  which  player  1  cannot  interpret  within  the  terms  of  his  own  model  (e.g.  player 
2  may  propose  a  move  corresponding  to  a  payoff  which  is  in  the  core  of  his  own  game  but  not  in 
the  set  o(  equilibria  of  the  game  perceived  by  player  1).  At  that  point,  player  1  has  a  different 
model  of  the  game  since,  by  assumption,  he  excludes  the  possibility  of  an  irrational  opponent 

Theracms  4.10  aixl  4.1 1  illustrate  the  role  of  the  bargaining  model  on  the  outctnne  of  the 
subjective  game.  The  outcome  o(  the  infinitely  rq^eated  subjective  game  depends  on  the  number  of 
steps  required  to  reach  an  agreement  If  the  model  pedicts  that  agreement  is  reached  in  one  step, 
each  player  may  not  have  the  opportunity  to  realize  that  the  intended  meaning  of  the  move  of  the 
other  player  was  different  from  what  he  perceived  it  to  be.  On  the  other  hand,  if  the  model  predicts 
that  more  than  one  step  may  be  required  to  reach  an  agreement,  then  the  players  may  have  the 
(^pominity  to  realize  that  their  subjective  perceptions  of  the  game  are  different 


SECTIONS.  CONCLUSIONS 


In  this  repot,  we  formulated  a  class  of  "subjective  games,"  where  the  players  have  different 
perceptions  of  the  rules  of  the  game  and  are  unaware  of  the  differences  in  their  perceptions.  This 
class  of  games  is  a  generalization  of  the  team  problem  of  asymptotic  agreement  studied  in  [14].  We 
devel<^;(ed  a  conceptual  and  analytical  framework  for  studying  the  effects  of  these  differences  in 
perception  on  the  strategies  used  by  the  players.  By  studying  in  detail  a  specific  class  of  symmetric 
games  of  incomplete  information,  we  showed  that  the  properties  of  these  "subjective  games"  are 
different  from  the  properties  of  similar  Bayesian  games.  Specifically,  many  features  of  the 
Bayesian  games,  such  as  the  positive  value  of  private  information  in  symmetric  games,  are  not 
maintained  when  the  players'  perceptions  of  the  game  are  allowed  to  differ. 

An  important  issue  which  arose  from  our  formulation  was  whether  the  players  discover  that 
their  perceptions  are  different  during  the  play  of  the  ganre.  We  showed  that,  in  a  static  game, 
{dayers  (rfiten  discover  at  the  end  the  game  that  they  have  different  perceptions.  Infinite  repetition 
of  the  game,  however,  may  alleviate  the  differences  in  the  players'  models,  and  lead  to  strategies 
where  the  players  do  not  discover  that  they  had  different  initial  perceptions.  In  addition,  we 
showed  that,  in  an  infinitely-repeated  game,  agreement  on  a  core  equilibrium  strategy  depends  on 
the  bargaining  model  adopted  by  the  players  in  the  game. 

The  rudimentary  investigation  reported  here  needs  to  be  carried  further.  It  is  important  to 
Characterize  the  classes  of  subjective  games  where  the  rational  strategies  are  insensitive  to  the 
differences  in  the  perceptions  of  the  players.  In  addition,  our  analysis  should  be  extenrted  beyond 
the  point  where  the  two  players  reach  an  impasse.  To  proceed  further,  it  may  be  appropriate  to 
ftamulate  the  comTict  situation  as  a  bargaining  problem  which  is  perceived  differently  by  each 
player.  Fot  such  a  bargaining  problem,  the  players  seek  strategies  which  belong  to  the  core  of  all 
games.  The  bargaining  theories  of  Harsanyi  [23],  Hearns  [25],  Kalai-Owen-Maschler  [24],  Owen 
[26]  and  Zeuthen  [22]  may  prove  useful  in  determining  such  strategies  and  could  be  the  starting 
point  for  further  investigation. 
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APPENDIX 


Proof  of  Leirjna  4.3: 


To  determine  the  payoff  to  which  player  1  can  be  limited  by  player  2,  it  suffices  to  determine  the 
approachable  set  for  player  2  when  he  tries  to  minimize  player  I's  payoff.  We  define  the  following 
payoff  matrix 


Ia,bl 

Ec/i] 

[d,cl 

The  first  (second)  component  of  each  entry  of  G  gives  the  payoff  to  player  1  when  game  1  (2)  is 
played.  These  payoffs  are  shown  in  figure  A-1.  Ihe  approachable  set  for  player  2  is  the  shaded 
area  of  payoffs  of  fig.  A-1  [9].  A  point  in  this  area  is  guaranteed  by  using  the  pure  strategy  t, 
which  is  the  Blackwell  strategy  [9]  fer  player  1  When  player  2  uses  this  strategy,  player  I’s 
payoff  cannot  exceed  c.  q.e.d. 

Proof  of  Lemma  4,7: 

The  maximum  payoff  which  player  1  can  hope  to  achieve  is  an  equilibrium  payoff  which  is 
individually  rational  for  player  2.  To  determine  this  payoff,  ^ve  define  the  payoff  matrix 


Isa+(l-s)b,  sa+(l-s)b] 

(sb+(l-s)a,  sc+(l-s)d] 

and  let  s  vary  from  0  to  1 .  For 

0^  s  ^  b)„ 

(c-b  +  a-d) 


we  obtain 


Isc+(l-s)d,  sb+(l-s)a] 
{sd+(l-s)c,  sd+(l-s)cj 

(A-l) 
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5b+(l-s)a  i  sd+(l-s)c  21  sa+(l-s)b  2.  sc+(l-s)d. 


(A-2) 


Rom  Lemma  4.5,  the  individually-rational  payoff  for  the  game  with  probability  s  is  sd  +  (1-s)  c. 
As  illustrated  in  figure  A-2,  the  maximum  payoff  for  player  1  when  A-1  holds  is  (l-s)c  +  sd. 

For  s  in  the  range 


we  get 


(c  r  bi  i  s  i  1/2 
(c-b  +  a-d) 

t 

sb4-(l-s>a  i  sa-f(l-s)b  i  sd+(l-s)c  ^  sc-i(l-s). 


Rgure  A-3  illustrates  the  maximum  payoff  which  can  be  obtained  by  player  1.  After  some  algebra, 
this  payoff  can  be  determined  as 


s(a-b+d-c)  +b  -  d 

For  s  >  1/2,  the  situation  is  illustrated  in  fig.  A-4.  In  this  case,  the  maximum  payoff  which  player 
1  can  achieve  is  sa -I- (1-s)  b. 

According  to  [9],  [20],  the  maximum  payoff  player  1  can  achieve  as  s  increases  from  0  to  1  is 
given  by  the  least  concave  function  which  majorizes  the  payoff  fuiKtion  as  s  varies  from  0  to  1.  As 
illustrated  in  fig.  A-S,  this  function  is  described  by 

sa  +  (l-s)c.  (A-6) 


Consequently,  the  maximum  payoff  player  1  can  achieve  when  s  =  p,  according  to  his  perception, 
is  given  by  pa  +  (1-p)  c.  The  maximum  payoff  he  can  achieve  according  to  player  2's  perception  is 
given  by  qa  + (l-q)c.  q.e.d. 
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PAYOFF  OF  PLAYER  1 


Rgure  4-1.  Individually  Rational  Payoffs  According  to  Player  1,  p>p’ 


Figure  4-2.  Individually  Rational  Payoffs  According  to  Player  1,  p<D 


PAYOFF  OF  PLAYER  1 


Figure  4-3,  Individually  Rational  Payoffs  According  to  Player  2,  q  <  1  -  p 


qe+(l-q)(j,qb+(l-q)»l 


[qa+(1-q)b,qa+(l-q')bl 

Inks 


[qd+(l-q)c,qd+(l-q)cl 


lqb+(  1  -q)a,qc+(  1  -q)d] 


/  / 


PAYOFF  OF  PL  AYER  2 


PAYOFF  OF  GAK'lE  1 .  PLAYER  1 


Hgure  A-1:  Payoff  to  Which  Player  1  can  be  Limited  by  Player  2 


Figure  A-2.  Maximum  Payoff  of  Player  1  when  0  <  s  <  !  -  p*. 


27 


PAY0FF0FPIAYER2  ^  I  PAYOFF  OF  PLAYER  2 


;sc+(1-s)d,sb+(1*s)a] 


PAYOFF  OF  PLAYER  1 


iure  A-3.  Maximum  Payoff  of  Player  1  when  1  -  p*  i  s  ^  1/2 


Isa+(1-s)b,sa+{1-s)b] 


PAYOFF  OF  PLAYER  1 


Figure  A-4.  Maximum  Payoff  of  Player  1  when  1/2  ^  s  1. 
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